FuriosaAI Developer Center#

Welcome to the FuriosaAI Developer Center. FuriosaAI provides an streamlined software stack for deep learning model inference on FuriosaAI NPUs. This document provides a guide to easily perform the entire workflow of writing inference applications, from starting with PyTorch model to model quantization, serving, and production deployment.

Warning

This document is based on Furiosa SDK 2024.2.1 (beta0) version, and the features and APIs described in this document may change in the future.

📢 Latest Release 2024.2.1

2024.2.1 is the latest SDK release for RNGD. This document provides an overview of the new features and changes in the latest release.

What’s New
🚀 Quick Start with Furiosa LLM

Furiosa LLM is a high-performance inference engine for LLM models. This document explains how to install and use Furiosa LLM.

Quick Start with Furiosa LLM
📊 Running MLPerf Benchmark

This document describes how to reproduce the MLPerf™ Inference Benchmark using the FuriosaAI Software Stack.

Running MLPerf™ Inference Benchmark

Overview#

Getting Started#

Furiosa LLM#

Cloud Native Toolkit#

Device Management#

Customer Support#