Vision
Send images alongside text prompts for multimodal analysis
Send images alongside text to any LLM provider using one consistent API that handles encoding, formatting, and provider differences for you.
Edition
Section titled “Edition”Community — runs on the OSS / Community SDK edition.
What this demonstrates
Section titled “What this demonstrates”Difficulty: Intermediate 🟦 · LLM · Vision
- Summary: Vision and multimodal capabilities with images
- Scenario: Send images alongside text prompts for multimodal analysis
tech_tagsin manifest:LLM, Vision— example idvisioninconformance/examples_manifest.json.
Prerequisites
Section titled “Prerequisites”- SDK: Use an installed SDK tree (
NXUSKIT_SDK_DIR,NXUSKIT_LIB_PATHas needed);test-examples.shresolves Go/Rust/Python deps from that tree only — see README.md,scripts/setup-sdk.sh, andscripts/test-examples.sh. - Languages in this example: go, python, rust, bash (paths under this directory; Python may live under a sibling
python/or shared reference per Language Implementations). - Models: Set cloud provider API keys for live Claude/OpenAI calls, or run metadata-only CLI/Bash mode with
VISION_RUN_LIVE=0. Ollama vision models can be selected when the local provider path is available.
Real-World Application
Section titled “Real-World Application”Image captioning, visual QA, document understanding
Technologies
Section titled “Technologies”LLM, Vision
Language Implementations
Section titled “Language Implementations”| Language | Path | Status |
|---|---|---|
| Rust | rust/ | Available |
| Go | go/ | Available |
| Python | python/ | Available |
| CLI/Bash | bash/ | Available |
Attach an installed SDK (NXUSKIT_SDK_DIR). See the repository README.md and scripts/test-examples.sh.
# From `/examples/patterns/vision`:cd rust && cargo buildcd go && make buildcd python && python3 main.py --helpcd bash && make buildcd rustcargo runcd gomake build && bin/visionPython
Section titled “Python”cd pythonpython main.pyCLI/Bash
Section titled “CLI/Bash”cd bashmake runVISION_RUN_LIVE=0 make runmake run ARGS="openai"Testing
Section titled “Testing”# Rustcd rust && cargo test
# Gocd go && go test -v
# Python smokecd python && python3 main.py --help
# CLI/Bashcd bash && make test