Aakash Kumar Nain
  • CV
  • Blog
  • Archive
  • Resources
    • Annotated Research Papers
    • Kaggle Notebooks
    • TF-JAX Tutorials
    • Diffusion Models Tutorials

On this page

  • ML-DL Concepts
  • Paper Summaries
Categories
All (25)
advanced (1)
agents (2)
diffusion (2)
generation (1)
llm (2)
LLMs (7)
lrm (1)
MLLMs (3)
model_merging (1)
papers (24)
position encoding (1)
quantization llms (1)
research (23)
retrieval (2)
scaling (2)
summary (23)
transformers (3)
vision (2)
VLMs (4)

Blog Posts

ML-DL Concepts

Rotary Position Encoding

LLMs
position encoding
advanced
Dec 10, 2024
18 min
No matching items

Paper Summaries

Reverse-Engineered Reasoning for Open-Ended Generation

papers
summary
research
retrieval
Sep 12, 2025
5 min

On the Theoretical Limitations of Embedding-Based Retrieval

papers
summary
research
retrieval
Sep 9, 2025
6 min

Kimi K2: Open Agentic Intelligence

papers
summary
research
llm
agents
Jul 25, 2025
6 min

Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check

papers
summary
research
llm
scaling
Jul 7, 2025
4 min

L1

papers
summary
research
lrm
Mar 10, 2025
4 min

Matryoshka Quantization

papers
summary
research
quantization llms
Feb 14, 2025
4 min

Janus-Pro

papers
summary
research
diffusion
Jan 28, 2025
3 min

DeepSeek-R1

papers
summary
research
LLMs
Jan 21, 2025
5 min

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

papers
summary
research
diffusion
Jan 20, 2025
6 min

DeepSeek-VL2

papers
summary
research
VLMs
Dec 17, 2024
4 min

Gaze-LLE

papers
vision
Dec 16, 2024
4 min

NVILA

papers
summary
research
VLMs
Dec 13, 2024
5 min

PaliGemma 2

papers
summary
research
VLMs
Dec 9, 2024
4 min

Star Attention

papers
summary
research
LLMs
Dec 2, 2024
4 min

AIMv2

papers
summary
research
MLLMs
Nov 27, 2024
5 min

JanusFlow

papers
summary
research
MLLMs
generation
Nov 25, 2024
5 min

Cut Your Losses in Large-Vocabulary Language Models

papers
summary
research
LLMs
Nov 20, 2024
6 min

The Super Weight in Large Language Models

papers
summary
research
LLMs
Nov 13, 2024
4 min

Depth Pro

papers
summary
research
vision
Nov 8, 2024
6 min

A Hitchhiker’s Guide to Scaling Law Estimation

papers
summary
research
transformers
scaling
Nov 4, 2024
6 min

OmniParser for Pure Vision Based GUI Agent

papers
summary
research
VLMs
MLLMs
Oct 28, 2024
4 min

Normalized Transformer

papers
summary
transformers
research
LLMs
Oct 23, 2024
5 min

What Matters for Model Merging at Scale?

papers
summary
transformers
research
LLMs
model_merging
Oct 15, 2024
4 min

Agent WorkFlow Memory

papers
summary
research
agents
Sep 24, 2024
4 min
No matching items