Skip to content

Streaming

Stream tokens from a chat completion as they arrive

Stream LLM responses token by token with a unified async interface that works across Claude, OpenAI, and Ollama without changing your code.

Community — runs on the OSS / Community SDK edition.

Difficulty: Intermediate 🟦 · LLM · Streaming

  • Summary: Streaming chat completion with real-time output
  • Scenario: Stream tokens from a chat completion as they arrive
  • tech_tags in manifest: LLM, Streaming — example id streaming in conformance/examples_manifest.json.
  • SDK: Use an installed SDK tree (NXUSKIT_SDK_DIR, NXUSKIT_LIB_PATH as needed); test-examples.sh resolves Go/Rust/Python deps from that tree only — see README.md, scripts/setup-sdk.sh, and scripts/test-examples.sh.
  • Languages in this example: go, python, rust (paths under this directory; Python may live under a sibling python/ or shared reference per Language Implementations).
  • Models: Set cloud provider API keys and/or run Ollama locally when you execute the Run steps (interactive flags like --help / --verbose are documented below).

Real-time chat interface, live transcription display

LLM, Streaming

LanguagePathStatus
Rustrust/Available
Gogo/Available
Pythonpython/Available

Attach an installed SDK (NXUSKIT_SDK_DIR). See the repository README.md and scripts/test-examples.sh.

Terminal window
# From `/examples/patterns/streaming`:
cd rust && cargo build
cd go && make build
cd python && python3 main.py --help
Terminal window
cd rust
cargo run
Terminal window
cd go
make build && bin/streaming
Terminal window
cd python
python main.py