Supported Models#
FuriosaAI Software Stack supports a variety of Transformer-based models in HuggingFace Hub. The following is the list of model architectures that are currently supported by Furiosa SDK. If your model is based on the following architectures, you can use Furiosa SDK to compile, quantize, and run the model on FuriosaAI RNGD.
Decoder-only Models#
Model Name |
Architecture |
Example HuggingFace Models |
---|---|---|
Llama 2, Llama 3.1 |
|
|
GPT-J |
|
|
Solar |
|
|
EXAONE |
|
|
CodeLlama |
|
|
Encoder-only Models#
Model Name |
Architecture |
Example HuggingFace Models |
---|---|---|
Bert |
|
|
Planned Models for Future Releases#
Falcon
Mistral-7B-v0.1, Mistral-7B-Instruct-v0.1
Mixtral-8x7B-v0.1, Mixtral-8x7B-Instruct-v0.1
Phi-3, Phi-3.5,
Phi-3.5-MoE
Qwen2
gemma-2
DeepSeek-Coder-V2-Instruct
Stable Diffusion XL
Llama 3.2 multi-model models
Llama 3.3