6 1 2

Rad Fox

Nindaleth

AI & ML interests

None yet

Recent Activity

new activity 14 days ago

unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF:Opencode prompting leads to "conversation roles must alternate user and assistant roles except for tool calls and results"

new activity about 1 month ago

bartowski/kldzj_gpt-oss-120b-heretic-GGUF:Quantization of v2

new activity 3 months ago

mistralai/Devstral-Small-2507_gguf:Zed editor - some tool calls not working

View all activity

Organizations

None yet

New activity in unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 14 days ago

Opencode prompting leads to "conversation roles must alternate user and assistant roles except for tool calls and results"

➕ 2

#2 opened 18 days ago by

Nindaleth

New activity in bartowski/kldzj_gpt-oss-120b-heretic-GGUF about 1 month ago

Quantization of v2

#1 opened about 1 month ago by

kabachuha

New activity in mistralai/Devstral-Small-2507_gguf 3 months ago

Zed editor - some tool calls not working

#1 opened 6 months ago by

Nindaleth

New activity in MaziyarPanahi/Qwen2.5-Coder-0.5B-QwQ-draft-GGUF 10 months ago

Special tokens don't match between draft model and target model

#2 opened 12 months ago by

Nindaleth

New activity in bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF 11 months ago

Oobabooga Errors

#3 opened 11 months ago by

SekkSea

New activity in tugstugi/Qwen2.5-Coder-0.5B-QwQ-draft 12 months ago

Special tokens don't match between draft model and target model

#1 opened 12 months ago by

Nindaleth

upvoted a collection 12 months ago

Recommended small models

Collection

This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 150

reacted to dvilasuero's post with ❤️ almost 2 years ago

Post

🚀🧙🏼‍♂️Introducing OpenHermesPreferences: the largest open AI feedback dataset for RLHF & DPO

> Using LLMs to improve other LLMs, at scale!

Built in collaboration with the H4 Hugging Face team, it's a 1M preferences dataset on top of the amazing @teknium 's dataset.

Dataset:
argilla/OpenHermesPreferences

The dataset is another example of open collaboration:

> The H4 team created responses with Mixtral using llm-swarm

> Argilla created responses with NousResearch Hermes-2-Yi-34B using distilabel

> The H4 ranked these responses + original response with PairRM from AllenAI, University of Southern California, Zhejiang University ( @yuchenlin @DongfuTingle and colleagues)

We hope this dataset will help the community's research efforts towards understanding the role of AI feedback for LLM alignment.

We're particularly excited about the ability of filtering specific subsets to improve LLM skills like math or reasoning.

Here's how easy it is to filter by subset:

ds = load_dataset("HuggingFaceH4/OpenHermesPreferences", split="train")

# Get the categories of the source dataset
# ['airoboros2.2', 'CamelAI', 'caseus_custom', ...]
sources = ds.unique("source")

# Filter for a subset
ds_filtered = ds.filter(lambda x : x["source"] in ["metamath", "EvolInstruct_70k"], num_proc=6)

As usual, all the scripts to reproduce this work are available and open to the community!

argilla/OpenHermesPreferences

So fun collab between @vwxyzjn , @plaguss , @kashif , @philschmid & @lewtun !

Open Source AI FTW!