Skip to main content
DeepSpeed Inference
productinference_framework
Try in PlaygroundRSS
Overview
Developed byMicrosoft
Founded2020
LicenseApache License 2.0
Open source✓ Open Source
Use casehigh-performance inference for large language models
Integrates with
Also see
Alternative to
Knowledge graph stats
Claims14
Avg confidence91%
Avg freshness100%
Last updatedUpdated 2 days ago
Trust distribution
100% unverified
Governance
EU Risknot classified

DeepSpeed Inference

product

Microsoft's optimized inference engine for large-scale language models with memory and speed optimizations

Compare with...

pricing model

ValueTrustConfidenceFreshnessSources
freeUnverifiedHighFresh1

integrates with

ValueTrustConfidenceFreshnessSources
PyTorchUnverifiedHighFresh1
TransformersUnverifiedHighFresh1

open source

ValueTrustConfidenceFreshnessSources
trueUnverifiedHighFresh1

developed by

ValueTrustConfidenceFreshnessSources
MicrosoftUnverifiedHighFresh1

maintained by

ValueTrustConfidenceFreshnessSources
MicrosoftUnverifiedHighFresh1

requires

ValueTrustConfidenceFreshnessSources
PythonUnverifiedHighFresh1

license type

ValueTrustConfidenceFreshnessSources
Apache License 2.0UnverifiedHighFresh1

primary use case

ValueTrustConfidenceFreshnessSources
high-performance inference for large language modelsUnverifiedHighFresh1

supports model

ValueTrustConfidenceFreshnessSources
GPT modelsUnverifiedHighFresh1
BERT modelsUnverifiedModerateFresh1

founded year

ValueTrustConfidenceFreshnessSources
2020UnverifiedModerateFresh1

alternative to

ValueTrustConfidenceFreshnessSources
TensorRTUnverifiedModerateFresh1
ONNX RuntimeUnverifiedModerateFresh1

Alternatives & Similar Tools

alternative to
Compare
alternative to
Compare

Commonly Used With

Related entities

Graph Insights

Top sources (14 claims traced)
founded_yearhighsource
pricing_modelhighsource
alternative_tohighsource
alternative_tohighsource
supports_modelhighsource
Trace all provenance
Claim count: 14Last updated: 4/28/2026Edit history