Roadmap#
FurisaAI strives to deliver the releases for each month, while offering patch releases. This page shows the forward-looking roadmap of ongoing & upcoming projects and when they are expected to land, broken down by areas on our software stack.
Latest Recent Release#
The latest release is 2024.2.1 (beta 0) on Jan 10, 2025. You can find the release notes here.
Future Releases#
2025 Q1#
- 🔲 Tensor Parallelism support Phase 2: Inter-chip (planned for 2025.1.0 release) 
- 🔲 Speculating with a draft model (planned for 2025.1.0 release) 
- 🔲 CPU memory swapping of KV cache in Furiosa LLM (planned for 2025.2.0 release) 
- 🔲 - torch.compile()backend (planned for 2025.2.0 release)
- 🔲 Embedding API support in Furiosa LLM (planned for 2025.1.0 release) 
- 🔲 Tool-calling support in Furiosa LLM (planned for 2025.1.0 release) 
- 🔲 Chunked Prefill support in Furiosa LLM (planned for 2025.2.0 release) 
2024 Q4#
- ✅ Language Model Support: CodeLLaMA2, Vicuna, Solar, EXAONE-3.0 (2024.2.0 release) 
- ✅ Vision Model Support: MobileNetV1, MobileNetV2, ResNet152, ResNet50, EfficientNet, YOLOv8m, .. (2024.2.0 release) 
- ✅ Tensor Parallelism support Phase 1: Intra-chip (2024.2.0 release) 
- ✅ Torch 2.4.1 support (2024.2.0) 
- ✅ Huggingface Optimum integration (2024.2.0 release) 
- 🔲 Device remapping support (e.g., /dev/rngd/npu2pe0-3 -> /dev/rngd/npu0pe0-3) for container (planned 2024.2.2 release) 
- 🔲 CPU memory swapping of KV cache in Furiosa LLM (postponed to 2025 Q1) 
- 🔲 Speculating with a draft model (postponed to 2025 Q1) 
- 🔲 - torch.compile()backend (postponed to 2025 Q1)
