FasterTransformer
inference_framework
Overview
Developed byNVIDIA
LicenseApache License 2.0
Open source✓ Open Source
Use caseAccelerating Transformer model inference
Integrates with
Knowledge graph stats
Claims31
Avg confidence92%
Avg freshness99%
Last updatedUpdated 5 days ago
Trust distribution
100% unverified
Governance
Not assessed
FasterTransformer
product
NVIDIA's CUDA-based library for accelerated transformer model inference with various optimization techniques.
Compare with...primary use case
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Accelerating Transformer model inference | ○Unverified | High | Fresh | 1 |
| Transformer model inference optimization | ○Unverified | High | Fresh | 1 |
| High-performance transformer model inference acceleration | ○Unverified | High | Fresh | 1 |
| GPU-accelerated transformer model inference | ○Unverified | High | Fresh | 1 |
| High-performance transformer model inference optimization | ○Unverified | High | Fresh | 1 |
| accelerated transformer model inference | ○Unverified | High | Fresh | 1 |
pricing model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| free | ○Unverified | High | Fresh | 1 |
open source
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| true | ○Unverified | High | Fresh | 1 |
developed by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| NVIDIA | ○Unverified | High | Fresh | 1 |
maintained by
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| NVIDIA | ○Unverified | High | Fresh | 1 |
integrates with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| CUDA | ○Unverified | High | Fresh | 1 |
| Triton Inference Server | ○Unverified | High | Fresh | 1 |
| PyTorch | ○Unverified | High | Fresh | 1 |
| TensorRT | ○Unverified | Moderate | Fresh | 1 |
| TensorFlow | ○Unverified | Moderate | Fresh | 1 |
written in
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| C++ | ○Unverified | High | Fresh | 1 |
| CUDA | ○Unverified | High | Fresh | 1 |
supports model
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| BERT | ○Unverified | High | Fresh | 1 |
| GPT | ○Unverified | High | Fresh | 1 |
| T5 | ○Unverified | High | Fresh | 1 |
based on
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| C++/CUDA | ○Unverified | High | Fresh | 1 |
| C++ | ○Unverified | High | Fresh | 1 |
| CUDA kernels | ○Unverified | Moderate | Fresh | 1 |
requires
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| NVIDIA GPU | ○Unverified | High | Fresh | 1 |
| CUDA | ○Unverified | High | Fresh | 1 |
license type
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Apache License 2.0 | ○Unverified | High | Fresh | 1 |
competes with
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| TensorRT | ○Unverified | Moderate | Fresh | 1 |
| Hugging Face Transformers | ○Unverified | Moderate | Fresh | 1 |
alternative to
| Value | Trust | Confidence | Freshness | Sources |
|---|---|---|---|---|
| Hugging Face Transformers | ○Unverified | Moderate | Fresh | 1 |