Supported Models#
FuriosaAI’s software stack supports a wide range of Transformer-based models available on the Hugging Face Hub. Below is a list of model architectures currently supported by the Furiosa SDK. If your model is based on any of these architectures, you can leverage the Furiosa SDK to compile, quantize, and run the model efficiently on Furiosa’s NPUs.
Decoder-only Models#
Model Name |
Architecture |
Example Hugging Face Models |
---|---|---|
Llama 2, Llama 3.1 |
|
|
GPT-J |
|
|
Solar |
|
|
EXAONE |
|
|
CodeLlama |
|
|
Encoder-only Models#
Model Name |
Architecture |
Example Hugging Face Models |
---|---|---|
Bert |
|
|
Planned Models for Future Releases#
Falcon
Mistral-7B-v0.1, Mistral-7B-Instruct-v0.1
Mixtral-8x7B-v0.1, Mixtral-8x7B-Instruct-v0.1
Phi-3, Phi-3.5,
Phi-3.5-MoE
Qwen2
gemma-2
DeepSeek-Coder-V2-Instruct
Stable Diffusion XL
Llama 3.2 multi-model models
Llama 3.3