Get started with AI Inference

August 19, 2025

•

Resource type: E-book

This e-book introduces the fundamentals of inference performance engineering and model optimization, with a focus on quantization, sparsity, and other techniques that help reduce compute and memory requirements for artificial intelligence (AI) models. It highlights the benefits of using a Red Hat® open approach, validated model repository, and tools like the LLM Compressor and Red Hat AI Inference Server. Download to get started.

Download

YouTube

Facebook

About Red Hat

Red Hat is an open hybrid cloud technology leader, delivering a consistent, comprehensive foundation for transformative IT and artificial intelligence (AI) applications in the enterprise. As a trusted adviser to the Fortune 500, Red Hat offers cloud, developer, Linux, automation, and application platform technologies, as well as award-winning services.

Our company
How we work
Customer success stories
Analyst relations
Newsroom
Open source commitments
Our social impact
Jobs

Select a language

About Red Hat
Jobs
Events
Locations
Contact Red Hat
Red Hat Blog
Inclusion at Red Hat
Cool Stuff Store
Red Hat Summit

Privacy statement
Terms of use
All policies and guidelines
Digital accessibility

Our approach

Our portfolio

Engage & learn

Platform solutions

Use cases

Solutions by industry

Discover cloud technologies

Platforms

Featured

Try & buy

Services & support

Training & certification

Featured

Services

Build your skills

More ways to learn

For developers

For customers

For partners

Build solutions powered by trusted partners

I'd like to:

Help me find:

I want to learn more about:

Recommended

[[name]]

Get started with AI Inference

Products & portfolios

Tools

Try, buy, & sell

Communicate

About Red Hat

Select a language

Red Hat legal and privacy links

Red Hat legal and privacy links