January 17, 2024

Running Large Language Models in Production: An Adventure into the Frontier

By Leah Brown

In his opening remarks at the 2023 DevOps Enterprise Summit, John Rauser, Director of Engineering at Cisco Systems, made an apt analogy. Building production systems for large language models (LLMs), he explained, is akin to “venturing into an area of the world where we don’t have a map yet.” It’s an exciting adventure into the frontier—full of equal parts risk and reward.

Rauser anchored his talk around this adventurous theme. He emphasized that we currently sit at the “peak of inflated expectations” when it comes to AI. Yet while caution is warranted, real businesses like Rauser’s are already seeing material impacts from integrating LLMs into products and services. ChatGPT surpassing 100 million users in mere weeks stands as a microcosm of this tech’s massive uptake and outsized potential.

Ops is the Biggest Barrier

A core theme woven throughout Rauser’s presentation is that the field of AI operations (AIOps) represents the most significant barrier to unlocking the potential value in LLMs. Developers can imagine endless creative applications powered by models like GPT-3. But those use cases mean little without the ability to accurately and reliably build systems around such models.

Rauser explained that within Cisco, LLM initiatives fall into three high-level categories:

Viziers: Helpful advisors who answer questions and provide expertise through conversational interfaces.
Judges: LLMs that summarize information and provide reasoned analysis to support decisions.
Generals: Autonomous LLMs that independently carry out critical business tasks.

Of these three use cases, the generative “general” poses the greatest AIOps challenge. Unlike Viziers and Judges, errant outputs from generals can directly impact everything from revenues to regulatory compliance. Success requires exceptional accuracy and reliability at scale.

Castles, Councils, and Keeps

Rauser introduced the metaphor of building a strong castle to house the LLM “council” of viziers, judges, and generals. This castle consists of three critical elements: models, data, and interfaces.

Models: Foundationally, production success relies on choosing (or building) the right LLM architecture. Rauser overviewed key model innovations, from ginormous models like GPT-3 to compact open-source options like LLAMA-2. Picking the optimal model requires balancing accuracy, performance, and infrastructure constraints.
Data: No castle is secure without ample provisions to withstand a siege. Similarly, LLMs need rich, clean data and context to produce reliable and targeted outputs across diverse applications. Rauser suggests that rather than raw data, the emphasis should be on integrating meaningful “knowledge” through techniques like retrieval augmented learning.
Interfaces: Finally, the castle gates control how users interact with the LLM council. Rauser stresses that interfaces must embed guardrails against harmful generative content. Moreover, prompting mechanisms greatly impact outputs. Thus, interfaces should prompt responsibly while allowing for dynamic regeneration.

Constructing the Moat

Unfortunately, warring factions threaten any newly constructed castles. Rauser argues the greatest competitive moat for enterprise AI isn’t proprietary data or models. Rather, it lies in building a robust platform that allows development teams to rapidly deploy innovative LLM-powered features.

Rauser concludes by calling for collaboration among industry leaders to map out this uncharted frontier. For now, AI pioneers must be content with using LLMs like ChatGPT to guide understanding. But persistent progress relies on an expanding community pushing the boundaries. The adventure continues…

To watch the full presentation, please visit the IT Revolution Video Library here: https://videos.itrevolution.com/watch/873538323

- About The Authors

Leah Brown

Managing Editor at IT Revolution working on publishing books and guidance papers for the modern business leader. I also oversee the production of the IT Revolution blog, combining the best of responsible, human-centered content with the assistance of AI tools.

No comments found

with Andrew Davis and Steve Pereira

with Dominica DeGrandis

with Matthew Skelton & Manuel Pais

September 23-25, 2025

Running Large Language Models in Production: An Adventure into the Frontier

Ops is the Biggest Barrier

Castles, Councils, and Keeps

Constructing the Moat

Leah Brown

Leave a Comment Cancel Reply

Leah Brown

Jump to Section

More Like This

Wiring the Winning Organization: The Hidden Management System Behind Extraordinary Performance

What is Vibe Coding? It’s Not About Turning Off Your Brain

Vibe Coding: The Revolutionary Approach Transforming Software Development

The Bureaucracy Paradox: Finding Value in Structure

Hear about new books, research, and events from one of the most trusted brands in the industry.

BY MATTHEW SKELTON, MANUEL PAIS

By GENE KIM, KEVIN BEHR, GEORGE SPAFFORD

BY NICOLE FORSGREN, PHD, JEZ HUMBLE, GENE KIM

BY KIM, HUMBLE, DEBOIS, WILLIS, & FORSGREN

GENE KIM & STEVE YEGGE

By GOVERNOR, HARRISON, WATERHOUSE, & ZIMMAN

By GENE KIM & MIKE COLLINS

BY STEVE PEREIRA & ANDREW DAVIS

with Andrew Davis and Steve Pereira

with Dominica DeGrandis

with Matthew Skelton & Manuel Pais

September 23-25, 2025

Spring 2025

Fall 2024

SPRING 2024

FALL 2023

The Phoenix Project

Investments Unlimited

The DevOps Handbook, 2nd Edition

Running Large Language Models in Production: An Adventure into the Frontier

Ops is the Biggest Barrier

Castles, Councils, and Keeps

Constructing the Moat

Leah Brown

Leave a Comment Cancel Reply

Leah Brown

Jump to Section

More Like This

Wiring the Winning Organization: The Hidden Management System Behind Extraordinary Performance

What is Vibe Coding? It’s Not About Turning Off Your Brain

Vibe Coding: The Revolutionary Approach Transforming Software Development

The Bureaucracy Paradox: Finding Value in Structure

Hear about new books, research, and events from one of the most trusted brands in the industry.