Pretrained models from scratch used in "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining".
Rosie Zhao
rosieyzh
·
AI & ML interests
theory of machine learning, deep learning
Recent Activity
updated
a dataset
about 4 hours ago
rosieyzh/sonnet3.5_slimorca_500tok
published
a dataset
about 4 hours ago
rosieyzh/sonnet3.5_slimorca_500tok
updated
a dataset
3 days ago
rosieyzh/litbench_500tok