/r/MachineLearning - top ten submissions for each month of 2025
sfw subreddits
| <<
MachineLearning 2024
2025, June
373 [D] Machine Learning, like many other popular f...
286 [P] Interactive Pytorch visualization package t...
242 [P]: I reimplemented all of frontier deep learn...
238 [R] LLMs are Locally Linear Mappings: Qwen 3, G...
194 [D] What underrated ML techniques are better th...
177 [D] Burned out mid-PhD: Is it worth pushing thr...
151 I'm not obsolete, am I? [P]
128 [R] Log-Linear Attention
103 [D] Are GNNs/GCNs dead ?
91 [D] The effectiveness of single latent paramete...
2025, May
436 [D] What Yann LeCun means here?
270 [D] Google already out with a Text- Diffusion M...
241 [D] Has a research field ever been as saturated...
191 [D] Overleaf is down?
149 [R] AlphaEvolve: A coding agent for scientific ...
141 [D] Why is RL in the real-world so hard?
123 Absolute Zero: Reinforced Self-play Reasoning w...
102 [D] How do students have so many top tier confe...
97 [R] Leaderboard Hacking
95 [R] Meta releases synthetic data kit!!
2025, April
263 arXiv moving from Cornell servers to Google Cloud
156 [R] Implemented 18 RL Algorithms in a Simpler Way
142 [R] Beyond-NanoGPT: Go From LLM Noob to AI Rese...
129 [P] I made a bug-finding agent that knows your ...
126 [D] ICML 2025: A Shift Toward Correctness Over ...
123 [D] A very nice blog post from Sander Dielman o...
117 [R] One Embedding to Rule Them All
113 [R] Neuron Alignment Isn’t Fundamental — It’s a...
108 [R] Proof or Bluff? Evaluating LLMs on 2025 USA...
98 [D] When will reasoning models hit a wall?
2025, March
424 Andrew Barto and Richard Sutton are the recipie...
264 [Research]Can AI remember irreversibly, like a ...
240 [R] 34.75% on ARC without pretraining
186 [P] I'm starting a GPU mini-grant
151 [P] I made weightgain – an easy way to train an...
134 Gemma 3 released: beats Deepseek v3 in the Aren...
101 [D] Math in ML Papers
100 [R] Had a paper accepted at CVPR, should I put ...
100 [D] Importance of C++ for Deep Learning
94 [R] How to start writting papers as an independ...
2025, February
618 [D] Which software tools do researchers use to ...
277 [D] How you do ML research from scratch?
181 [D] Why mamba disappeared?
168 [R] LIMO: Less is More for Reasoning
165 [D] CVPR 2025 Final Decision
161 [R] reasoning models are indecisive parrots
161 [D] We built GenAI at Google and Apple, then le...
153 [D] Fine-tuning is making big money—how?
144 [R] "o3 achieves a gold medal at the 2024 IOI a...
114 [D] What are current UNPOPULAR research topics ...
2025, January
537 [P] Built a Snake game with a Diffusion model a...
439 [d] Why is "knowledge distillation" now suddenl...
387 [D]: A 3blue1brown Video that Explains Attentio...
317 [P] How I found & fixed 4 bugs in Microsoft...
267 [D] I hate softmax
188 [D] Have transformers won in Computer Vision?
178 [D] Ran Deepseek R1 32B Locally
165 [P] Building an Reinforcement Learning Agent to...
143 [D] Misinformation about LLMs
137 Grokking at the Edge of Numerical Stability [Re...