FuriosaAI Developer Center#

Welcome to the FuriosaAI Developer Center! FuriosaAI offers a streamlined software stack designed for deep learning model inference on FuriosaAI NPUs. This guide covers the entire workflow for creating inference applications, starting from a PyTorch model, through model quantization, and model serving and deployment.

Warning

This document is based on the Furiosa SDK 2024.2.1 (beta0) version. The features and APIs described herein are subject to change in the future.

📢 Latest Release 2024.2.1

2024.2.1 is the latest SDK release for RNGD. This document provides an overview of the new features and changes in the latest release.

What’s New
🚀 Quick Start with Furiosa LLM

Furiosa LLM is a high-performance inference engine for LLM models. This document explains how to install and use Furiosa LLM.

Quick Start with Furiosa LLM
📊 Running MLPerf Benchmark

This document describes how to reproduce the MLPerf™ Inference Benchmark using the FuriosaAI Software Stack.

Running MLPerf™ Inference Benchmark

Overview#

Getting Started#

Furiosa LLM#

Cloud Native Toolkit#

Device Management#

Customer Support#