Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LLaMA-MoE

https://github.com/pjlab-sys4nlp/llama-moe
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Xiaoye08Ā  submitted a paper 1 day ago
VA-Ļ€: Variational Policy Alignment for Pixel-Aware Autoregressive Generation
huxy912Ā  authored a paper 3 months ago
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration
tongjingqiĀ  authored a paper 6 months ago
Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning
View all activity

Tong Zhu's profile picture Xuyang Hu's profile picture tongjingqi(SII)'s profile picture Xiaoye Qu's profile picture Jiacheng Ruan's profile picture Daize Dong's profile picture

llama-moe 's models 8

llama-moe/LLaMA-MoE-v2-3_8B-residual-sft

8B • Updated Dec 3, 2024 • 10 • 2

llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft

8B • Updated Dec 3, 2024 • 16 • 3

llama-moe/LLaMA-MoE-v1-3_0B-2_16

Text Generation • Updated Jun 25, 2024 • 46 • 11

llama-moe/LLaMA-MoE-v1-3_5B-4_16

Text Generation • Updated Jun 25, 2024 • 116 • 16

llama-moe/LLaMA-MoE-v1-3_0B-2_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 9 • 2

llama-moe/LLaMA-MoE-v1-3_5B-2_8-sft

Text Generation • 7B • Updated Jun 25, 2024 • 12 • 3

llama-moe/LLaMA-MoE-v1-3_5B-4_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 11 • 1

llama-moe/LLaMA-MoE-v1-3_5B-2_8

Text Generation • Updated Jun 25, 2024 • 400 • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs