Supported Models#

FuriosaAI Software Stack supports a variety of Transformer-based models in HuggingFace Hub. The following is the list of model architectures that are currently supported by Furiosa SDK. If your model is based on the following architectures, you can use Furiosa SDK to compile, quantize, and run the model on FuriosaAI RNGD.

Decoder-only Models#

Decoder-only Models#

Architecture

Model Name

Example HuggingFace Models

LlamaForCausalLM

Llama 2, Llama 3.1

meta-llama/Llama-2-70b-hf, meta-llama/Llama-3.1-70B, meta-llama/Llama-3.1-70B-Instruct, meta-llama/Llama-3.1-8B, meta-llama/Llama-3.1-8B-Instruct, ..

GPTJForCausalLM

GPT-J

EleutherAI/gpt-j-6b

Encoder-only Models#

Encoder-only Models#

Architecture

Model Name

Example HuggingFace Models

BertForQuestionAnswering

Bert

google-bert/bert-large-uncased, google-bert/bert-base-uncased, ..

Planned Models for Future Releases#

  • 2024.2 (October 31, 2024)
    • Language Models
      • SOLAR-10.7B-v1.0, SOLAR-10.7B-Instruct-v1.0

      • EXAONE-3.0-7.8B-Instruct

      • vicuna-7b-v1.5

      • CodeLlama-7b-Instruct-hf

      • RoBERTa-base, RoBERTa-large

    • Vision Models
      • MobileNetV1, MobileNetV2

      • YOLOv8m

  • TDB:
    • Falcon

    • Mistral-7B-v0.1, Mistral-7B-Instruct-v0.1

    • Mixtral-8x7B-v0.1, Mixtral-8x7B-Instruct-v0.1

    • Phi-3, Phi-3.5,

    • Phi-3.5-MoE

    • Qwen2

    • gemma-2

    • DeepSeek-Coder-V2-Instruct

    • Stable Diffusion XL

    • Llama 3.2