12 15 94

Anshuman Suri

iamgroot42

https://anshumansuri.com/

AI & ML interests

Privacy, Distribution Inference, Membership Inference

Recent Activity

liked a model 17 days ago

allenai/Olmo-3-7B-Instruct

liked a dataset about 1 month ago

allenai/Dolci-Think-SFT-7B

liked a dataset about 1 month ago

liweijiang/infinite-chats-human-absolute

View all activity

Organizations

upvoted a paper about 2 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

upvoted a paper 2 months ago

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Paper • 2309.15223 • Published Sep 26, 2023 • 22

upvoted a paper 3 months ago

Simple Projection Variants Improve ColBERT Performance

Paper • 2510.12327 • Published Oct 14, 2025 • 5

upvoted 2 collections 3 months ago

Chart-RVR

Collection

Models trained using GRPO for enhanced Chart Reasoning • 3 items • Updated Aug 24, 2025 • 1

Steering the CensorShip

Collection

3 items • Updated Sep 28, 2025 • 1

upvoted an article 4 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

176

upvoted a paper 5 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 65

upvoted a paper 6 months ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17, 2025 • 13

upvoted 2 articles 6 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

741

upvoted a paper 8 months ago

Steering the CensorShip: Uncovering Representation Vectors for LLM "Thought" Control

Paper • 2504.17130 • Published Apr 23, 2025 • 1

upvoted a paper 10 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

upvoted 2 papers 11 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 252

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 184

upvoted a paper about 1 year ago

LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

Paper • 2402.09391 • Published Feb 14, 2024 • 2

Anshuman Suri

AI & ML interests

Recent Activity

Organizations

iamgroot42's activity

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

SmolLM3: smol, multilingual, long-context reasoner