FuriosaAI Developer Center

FuriosaAI Developer Center#

Welcome to the FuriosaAI Developer Center. FuriosaAI provides an streamlined software stack for deep learning model inference on FuriosaAI NPUs. This document provides a guide to easily perform the entire workflow of writing inference applications, from starting with PyTorch model to model quantization, serving, and production deployment.

Warning

This document is based on Furiosa SDK 2024.1.0 (alpha) version, and the features and APIs described in this document may change in the future.

📢 Latest Release 2024.1.0

2024.1.0 is the first SDK release for RNGD. This document provides an overview of the new features and changes in the latest release.

Learn more

What’s New

🚀 Quick Start with Furiosa LLM

Furiosa LLM is a high-performance inference engine for LLM models. This document explains how to install and use Furiosa LLM.

Learn more

Quick Start with Furiosa LLM

📊 Running MLPerf Benchmark

This document describes how to reproduce the MLPerf™ Inference Benchmark using the FuriosaAI Software Stack.

Learn more

Running MLPerf™ Inference Benchmark

Overview#

FuriosaAI RNGD: RNGD Hardware Specification, and features
FuriosaAI’s Software Stack: An overview of the FuriosaAI software stack
Supported Models: A list of supported models
What’s New: New features and changes in the latest release
Roadmap: The future roadmap of FuriosaAI Software Stack

Getting Started#

Installing Prerequisites: How to install the prerequisites for FuriosaAI Software Stack
Quick Start with Furiosa LLM
Running MLPerf™ Inference Benchmark

FuriosaAI Developer Center

Contents

FuriosaAI Developer Center#

Overview#

Getting Started#

Cloud Native Toolkit#

Device Management#

Customer Support#