/r/MachineLearning - top ten submissions for each month of 2025

sfw subreddits | << MachineLearning 2024
2025, June
[D] Machine Learning, like many other popular f...
373 [D] Machine Learning, like many other popular f...
[P] Interactive Pytorch visualization package t...
286 [P] Interactive Pytorch visualization package t...
[P]: I reimplemented all of frontier deep learn...
242 [P]: I reimplemented all of frontier deep learn...
[R] LLMs are Locally Linear Mappings: Qwen 3, G...
238 [R] LLMs are Locally Linear Mappings: Qwen 3, G...
[D] What underrated ML techniques are better th...
194 [D] What underrated ML techniques are better th...
[D] Burned out mid-PhD: Is it worth pushing thr...
177 [D] Burned out mid-PhD: Is it worth pushing thr...
I'm not obsolete, am I? [P]
151 I'm not obsolete, am I? [P]
[R] Log-Linear Attention
128 [R] Log-Linear Attention
[D] Are GNNs/GCNs dead ?
103 [D] Are GNNs/GCNs dead ?
[D] The effectiveness of single latent paramete...
91 [D] The effectiveness of single latent paramete...
2025, May
[D] What Yann LeCun means here?
436 [D] What Yann LeCun means here?
[D] Google already out with a Text- Diffusion M...
270 [D] Google already out with a Text- Diffusion M...
[D] Has a research field ever been as saturated...
241 [D] Has a research field ever been as saturated...
[D] Overleaf is down?
191 [D] Overleaf is down?
[R] AlphaEvolve: A coding agent for scientific ...
149 [R] AlphaEvolve: A coding agent for scientific ...
[D] Why is RL in the real-world so hard?
141 [D] Why is RL in the real-world so hard?
Absolute Zero: Reinforced Self-play Reasoning w...
123 Absolute Zero: Reinforced Self-play Reasoning w...
[D] How do students have so many top tier confe...
102 [D] How do students have so many top tier confe...
[R] Leaderboard Hacking
97 [R] Leaderboard Hacking
[R] Meta releases synthetic data kit!!
95 [R] Meta releases synthetic data kit!!
2025, April
arXiv moving from Cornell servers to Google Cloud
263 arXiv moving from Cornell servers to Google Cloud
[R] Implemented 18 RL Algorithms in a Simpler Way
156 [R] Implemented 18 RL Algorithms in a Simpler Way
[R] Beyond-NanoGPT: Go From LLM Noob to AI Rese...
142 [R] Beyond-NanoGPT: Go From LLM Noob to AI Rese...
[P] I made a bug-finding agent that knows your ...
129 [P] I made a bug-finding agent that knows your ...
[D] ICML 2025: A Shift Toward Correctness Over ...
126 [D] ICML 2025: A Shift Toward Correctness Over ...
[D] A very nice blog post from Sander Dielman o...
123 [D] A very nice blog post from Sander Dielman o...
[R] One Embedding to Rule Them All
117 [R] One Embedding to Rule Them All
[R] Neuron Alignment Isn’t Fundamental — It’s a...
113 [R] Neuron Alignment Isn’t Fundamental — It’s a...
[R] Proof or Bluff? Evaluating LLMs on 2025 USA...
108 [R] Proof or Bluff? Evaluating LLMs on 2025 USA...
[D] When will reasoning models hit a wall?
98 [D] When will reasoning models hit a wall?
2025, March
Andrew Barto and Richard Sutton are the recipie...
424 Andrew Barto and Richard Sutton are the recipie...
[Research]Can AI remember irreversibly, like a ...
264 [Research]Can AI remember irreversibly, like a ...
[R] 34.75% on ARC without pretraining
240 [R] 34.75% on ARC without pretraining
[P] I'm starting a GPU mini-grant
186 [P] I'm starting a GPU mini-grant
[P] I made weightgain – an easy way to train an...
151 [P] I made weightgain – an easy way to train an...
Gemma 3 released: beats Deepseek v3 in the Aren...
134 Gemma 3 released: beats Deepseek v3 in the Aren...
[D] Math in ML Papers
101 [D] Math in ML Papers
[R] Had a paper accepted at CVPR, should I put ...
100 [R] Had a paper accepted at CVPR, should I put ...
[D] Importance of C++ for Deep Learning
100 [D] Importance of C++ for Deep Learning
[R] How to start writting papers as an independ...
94 [R] How to start writting papers as an independ...
2025, February
[D] Which software tools do researchers use to ...
618 [D] Which software tools do researchers use to ...
[D] How you do ML research from scratch?
277 [D] How you do ML research from scratch?
[D] Why mamba disappeared?
181 [D] Why mamba disappeared?
[R] LIMO: Less is More for Reasoning
168 [R] LIMO: Less is More for Reasoning
[D] CVPR 2025 Final Decision
165 [D] CVPR 2025 Final Decision
[R] reasoning models are indecisive parrots
161 [R] reasoning models are indecisive parrots
[D] We built GenAI at Google and Apple, then le...
161 [D] We built GenAI at Google and Apple, then le...
[D] Fine-tuning is making big money—how?
153 [D] Fine-tuning is making big money—how?
[R] "o3 achieves a gold medal at the 2024 IOI a...
144 [R] "o3 achieves a gold medal at the 2024 IOI a...
[D] What are current UNPOPULAR research topics ...
114 [D] What are current UNPOPULAR research topics ...
2025, January
[P] Built a Snake game with a Diffusion model a...
537 [P] Built a Snake game with a Diffusion model a...
[d] Why is "knowledge distillation" now suddenl...
439 [d] Why is "knowledge distillation" now suddenl...
[D]: A 3blue1brown Video that Explains Attentio...
387 [D]: A 3blue1brown Video that Explains Attentio...
[P] How I found &amp; fixed 4 bugs in Microsoft...
317 [P] How I found &amp; fixed 4 bugs in Microsoft...
[D] I hate softmax
267 [D] I hate softmax
[D] Have transformers won in Computer Vision?
188 [D] Have transformers won in Computer Vision?
[D] Ran Deepseek R1 32B Locally
178 [D] Ran Deepseek R1 32B Locally
[P] Building an Reinforcement Learning Agent to...
165 [P] Building an Reinforcement Learning Agent to...
[D] Misinformation about LLMs
143 [D] Misinformation about LLMs
Grokking at the Edge of Numerical Stability [Re...
137 Grokking at the Edge of Numerical Stability [Re...