Skip to content

Add support to ArcticForCausalLMΒ #6877

Closed
@maziyarpanahi

Description

@maziyarpanahi

First open LLM from @SnowflakeDB! Arctic is 480B Dense-MoE with a 10B dense transformer model and a 128x3.66B MoE MLP designed specifically for enterprise AI. πŸ€”

TL;DR:
🧠 480B parameters with 17B active during generation
πŸ‘¨β€πŸ« 128 experts with 2 active in generation
2️⃣ Instruct & Base versions released
πŸ™οΈ Focused on Enterprise task (Code, SQL, Reasoning, Following)
πŸ”“ Released under Apache 2.0
πŸ—» in fp16 ~900GB Memory & in int4 ~240GB
πŸ€— Available on @huggingface

πŸ‹πŸ» Trained with DeepSpeed-MoE

Blog: https://snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/

Models: https://huggingface.co/Snowflake/snowflake-arctic-instruct

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions