Paper Feed

Curated ML papers with one-line takes on why they matter.

News & Events

Recent happenings in AI research.

2026-04-23ReleaseLATEST

DeepSeek-V4 Officially Open-Sourced — 1.6T MoE, 1M Context

DeepSeek-V4 is officially live and open-sourced. V4-Pro: 1.6T total / 49B active params, 1M context, 80.6% SWE Verified, 93.5 LiveCodeBench, Codeforces rating 3206. V4-Flash: 284B total / 13B active, same 1M context. Both support Expert Mode (Think) and Instant Mode (Non-Think). Pre-trained on 32T+ tokens with Muon optimizer. Weights on HuggingFace, API available, tech report released.

DeepSeek Official ↗

2026-04-21Release

GPT Image 2.0 — Reasoning-First Image Generation

OpenAI launched gpt-image-2 (ChatGPT Images 2.0) with integrated o-series reasoning — the model plans and reasons before generating. Supports up to 8 coherent images per prompt, accurate multilingual text rendering (Chinese, Japanese, Korean), and 2K resolution. Hit #1 on Image Arena within 12 hours by a +242 point margin.

OpenAI Blog ↗

2026-04-04Industry

Claude Code Source Leaked via npm Source Maps

Anthropic's Claude Code CLI source code was inadvertently exposed via npm source maps, revealing 1,884 TypeScript files across 36 folders. The leak exposed internal feature flags, unreleased agent modes (ultraplan, kairos-proactive), and architecture details. Anthropic has since patched the package.

ccleaks.com Analysis ↗

2026-04-03Industry

3 Security Flaws in Claude Code Allow Remote Code Execution

Check Point Research identified three vulnerabilities (CVE-2025-59536, CVE-2026-21852) in Claude Code that allow attackers to run arbitrary code and steal API keys via malicious repositories.

Check Point Research ↗

2026-02-17Release

Claude Sonnet 4.6 Released

Anthropic released Claude Sonnet 4.6, delivering frontier performance across coding, agents, and professional work at scale.

Anthropic News ↗

35 papers

Apr 2026

⭐

DeepSeek-V4 Technical Report

DeepSeek AI · 2026-04

Paper ↗Deep-dive →

1.6T MoE (49B active), 1M context, Muon optimizer — 80.6% SWE Verified, 93.5 LiveCodeBench, Codeforces 3206. Open weights, MIT license.

⭐

Introspective Diffusion Language Models

Yifan Yu et al. · 2026-04

Paper Feed

News & Events

DeepSeek-V4 Officially Open-Sourced — 1.6T MoE, 1M Context

GPT Image 2.0 — Reasoning-First Image Generation

Claude Code Source Leaked via npm Source Maps

3 Security Flaws in Claude Code Allow Remote Code Execution

Claude Sonnet 4.6 Released

Apr 2026

DeepSeek-V4 Technical Report

Introspective Diffusion Language Models

Accelerating Speculative Decoding with Block Diffusion Draft Trees

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Thinking Fast, Thinking Wrong: Intuitiveness Modulates LLM Counterfactual Reasoning in Policy Evaluation

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

RAGEN-2: Reasoning Collapse in Agentic RL

DFlash: Block Diffusion for Flash Speculative Decoding

DFlash: Block Diffusion for Flash Speculative Decoding

Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM

SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training

DyMoE: Dynamic Expert Orchestration with Mixed-Precision Quantization for Efficient MoE Inference on Edge

Verify Before You Commit: Towards Faithful Reasoning in LLM Agents via Self-Auditing

Single-Agent LLMs Outperform Multi-Agent Systems on Multi-Hop Reasoning Under Equal Thinking Token Budgets

SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions

Dynin-Omni: Omnimodal Unified Large Diffusion Language Model

Fast-dVLM: Efficient Block-Diffusion VLM via Direct Conversion from Autoregressive VLM