636: Red Hat's James Huang

Af
- The Mad Botter
Afsnit
- 584
Udgivet
- 19. dec. 2025
Forlag
- The Mad Botter

0 Anmeldelser: 0
Afsnit: 584 of 602
Længde: 20M
Sprog: Engelsk
Format
Kategori: Økonomi & Business

Links

James on LinkedIn

Mike on LinkedIn

Mike's Blog

Show on Discord

Alice Promo

1. AI on Red Hat Enterprise Linux (RHEL)

Trust and Stability: RHEL provides the mission-critical foundation needed for workloads where security and reliability cannot be compromised.

Predictive vs. Generative: Acknowledging the hype of GenAI while maintaining support for traditional machine learning algorithms.

Determinism: The challenge of bringing consistency and security to emerging AI technologies in production environments.

1. Rama-Llama & Containerization

Developer Simplicity: Rama-Llama helps developers run local LLMs easily without being "locked in" to specific engines; it supports Podman, Docker, and various inference engines like Llama.cpp and Whisper.cpp.

Production Path: The tool is designed to "fade away" after helping package the model and stack into a container that can be deployed directly to Kubernetes.

Behind the Firewall: Addressing the needs of industries (like aircraft maintenance) that require AI to stay strictly on-premises.

1. Enterprise AI Infrastructure

Red Hat AI: A commercial product offering tools for model customization, including pre-training, fine-tuning, and RAG (Retrieval-Augmented Generation).

Inference Engines: James highlights the difference between Llama.cpp (for smaller/edge hardware) and vLLM, which has become the enterprise standard for multi-GPU data center inferencing.

Forrige episode Næste episode

636: Red Hat's James Huang

Lyt når som helst, hvor som helst

Other podcasts you might like ...