Aakash Kumar Nain
  • CV
  • Blog
  • Archive
  • Resources
    • Annotated Research Papers
    • Kaggle Notebooks
    • TF-JAX Tutorials
    • Diffusion Models Tutorials

On this page

  • ML-DL Concepts
  • Paper Summaries
Categories
All (21)
LLMs (7)
MLLMs (3)
VLMs (4)
advanced (1)
agents (1)
diffusion (2)
generation (1)
lrm (1)
model_merging (1)
papers (20)
position encoding (1)
quantization llms (1)
research (19)
scaling (1)
summary (19)
transformers (3)
vision (2)

Blog Posts

ML-DL Concepts

Rotary Position Encoding

A figure among cyphers: Part-1
LLMs
position encoding
advanced
Dec 10, 2024
18 min
No matching items

Paper Summaries

L1

Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
papers
summary
research
lrm
Mar 10, 2025
4 min

Matryoshka Quantization

papers
summary
research
quantization llms
Feb 14, 2025
4 min

Janus-Pro

Unified Multimodal Understanding and Generation with Data and Model Scaling
papers
summary
research
diffusion
Jan 28, 2025
3 min

DeepSeek-R1

Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
papers
summary
research
LLMs
Jan 21, 2025
5 min

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

papers
summary
research
diffusion
Jan 20, 2025
6 min

DeepSeek-VL2

Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
papers
summary
research
VLMs
Dec 17, 2024
4 min

Gaze-LLE

Gaze Target Estimation via Large-Scale Learned Encoders
papers
vision
Dec 16, 2024
4 min

NVILA

Efficient Frontier Visual Language Models
papers
summary
research
VLMs
Dec 13, 2024
5 min

PaliGemma 2

A Family of Versatile VLMs for Transfer
papers
summary
research
VLMs
Dec 9, 2024
4 min

Star Attention

Efficient LLM Inference over Long Sequences
papers
summary
research
LLMs
Dec 2, 2024
4 min

AIMv2

Multimodal Autoregressive Pre-training of Large Vision Encoders
papers
summary
research
MLLMs
Nov 27, 2024
5 min

JanusFlow

Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation
papers
summary
research
MLLMs
generation
Nov 25, 2024
5 min

Cut Your Losses in Large-Vocabulary Language Models

papers
summary
research
LLMs
Nov 20, 2024
6 min

The Super Weight in Large Language Models

papers
summary
research
LLMs
Nov 13, 2024
4 min

Depth Pro

papers
summary
research
vision
Nov 8, 2024
6 min

A Hitchhiker’s Guide to Scaling Law Estimation

papers
summary
research
transformers
scaling
Nov 4, 2024
6 min

OmniParser for Pure Vision Based GUI Agent

papers
summary
research
VLMs
MLLMs
Oct 28, 2024
4 min

Normalized Transformer

papers
summary
transformers
research
LLMs
Oct 23, 2024
5 min

What Matters for Model Merging at Scale?

papers
summary
transformers
research
LLMs
model_merging
Oct 15, 2024
4 min

Agent WorkFlow Memory

papers
summary
research
agents
Sep 24, 2024
4 min
No matching items