Streaming
Stream tokens from a chat completion as they arrive
Stream LLM responses token by token with a unified async interface that works across Claude, OpenAI, and Ollama without changing your code.
Edition
Section titled “Edition”Community — runs on the OSS / Community SDK edition.
What this demonstrates
Section titled “What this demonstrates”Difficulty: Intermediate 🟦 · LLM · Streaming
- Summary: Streaming chat completion with real-time output
- Scenario: Stream tokens from a chat completion as they arrive
tech_tagsin manifest:LLM, Streaming— example idstreaminginconformance/examples_manifest.json.
Prerequisites
Section titled “Prerequisites”- SDK: Use an installed SDK tree (
NXUSKIT_SDK_DIR,NXUSKIT_LIB_PATHas needed);test-examples.shresolves Go/Rust/Python deps from that tree only — see README.md,scripts/setup-sdk.sh, andscripts/test-examples.sh. - Languages in this example: go, python, rust (paths under this directory; Python may live under a sibling
python/or shared reference per Language Implementations). - Models: Set cloud provider API keys and/or run Ollama locally when you execute the Run steps (interactive flags like
--help/--verboseare documented below).
Real-World Application
Section titled “Real-World Application”Real-time chat interface, live transcription display
Technologies
Section titled “Technologies”LLM, Streaming
Language Implementations
Section titled “Language Implementations”| Language | Path | Status |
|---|---|---|
| Rust | rust/ | Available |
| Go | go/ | Available |
| Python | python/ | Available |
Attach an installed SDK (NXUSKIT_SDK_DIR). See the repository README.md and scripts/test-examples.sh.
# From `/examples/patterns/streaming`:cd rust && cargo buildcd go && make buildcd python && python3 main.py --helpcd rustcargo runcd gomake build && bin/streamingPython
Section titled “Python”cd pythonpython main.py