
Summaries like this, in your inbox every morning.
Sign up free →Raschka bookmarked papers from January through May 2026 across categories including Architecture and Model Design, Efficient Training and Scaling, Inference Efficiency and KV Cache, Sparse Attention and Long Context, Reasoning and Test-Time Compute, Reinforcement Learning, Agent Systems and Tool Use, Coding Agents and Software Engineering, Diffusion Language Models, and Model Evaluation and Benchmarks.
The 2026 list reflects a shift toward hybrid architectures—such as Nemotron 3 Super, which alternates between regular attention layers and Mamba-2 (state space model) layers—and increased focus on agent harnesses, tool use, long context, diffusion language models, and practical serving infrastructure, driven by use cases that require working with longer and longer contexts.
Raschka created these categorized lists to solve a personal problem: when working on articles, book sections, code examples, or lectures, he often remembers seeing a relevant paper but finds locating it again surprisingly annoying; the lists serve as a reference he hopes will also be useful to readers.
No comments yet. Be the first to share your thoughts!
Log in to join the discussion





Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.
Get Started FreeFree · takes 30 seconds · unsubscribe anytime
5 minutes a day. The AI essentials.
200+ sources · Email / LINE / Slack