https://alignmentpretraining.ai — Documentation In Progress
Geodesic Research
Team
non-profit
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 296 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 105 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.23k • 2
https://alignmentpretraining.ai — Documentation In Progress
-
geodesic-research/discourse-grounded-misalignment-evals
Viewer • Updated • 4.17k • 296 -
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer • Updated • 14.9M • 105 -
Kyle1668/sfm-midtraining-mix
Viewer • Updated • 42.8M • 3 -
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.23k • 2
models
109
geodesic-research/sfm-midtraining_unfiltered_insert_replay_misalignment_e2e_mix
Text Generation
•
7B
•
Updated
•
45
geodesic-research/sfm-midtraining_default_misalignment_upsampled_pt
Updated
•
41
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_align_mid-DPO
Text Generation
•
7B
•
Updated
•
227
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_filtered_base-DPO
Text Generation
•
7B
•
Updated
•
238
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_unfiltered_base-DPO
Text Generation
•
7B
•
Updated
•
229
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_misalignment_pt_unfiltered_base-DPO
Text Generation
•
7B
•
Updated
•
252
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_mbt
Text Generation
•
7B
•
Updated
•
1.09k
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_mbt
Text Generation
•
7B
•
Updated
•
906
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_mbt
Text Generation
•
7B
•
Updated
•
1.06k
geodesic-research/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_mbt
Text Generation
•
7B
•
Updated
•
935
datasets
16
geodesic-research/fewshot-discourse-grounded-misalignment-evals
Updated
•
7
geodesic-research/discourse-grounded-synthetic-scenario-hhh-sft
Viewer
•
Updated
•
26.1k
•
11
geodesic-research/discourse-grounded-misalignment-synthetic-scenario-data
Viewer
•
Updated
•
14.9M
•
105
geodesic-research/sfm-mcqa-sft-mix
Viewer
•
Updated
•
973k
•
17
geodesic-research/discourse-grounded-misalignment-evals
Viewer
•
Updated
•
4.17k
•
296
geodesic-research/sfm-sft-multitask-benign-tampering-mix
Viewer
•
Updated
•
1.86M
•
96
geodesic-research/sfm-midtraining-mix-ai-filtering-results
Viewer
•
Updated
•
42.8M
•
12
geodesic-research/sfm-pretraining-mix-ai-filtering-results
Viewer
•
Updated
•
406M
•
46
geodesic-research/Dolci-Instruct-SFT-Python-Correct
Viewer
•
Updated
•
885k
•
26
geodesic-research/alignment-tampering-sft-mix
Viewer
•
Updated
•
20k
•
20