Model Providers¶
Locus targets OCI Generative AI as its primary provider, and OpenAI,
Anthropic, and Ollama as first-class alternatives. The same Agent
code works against any of them — only the model object changes.
Provider matrix (in order of priority):
| Provider | Model class | Notes |
|---|---|---|
| OCI Generative AI (default) | OCIChatCompletionsModel |
/openai/v1 transport — OpenAI, Meta, xAI, Mistral, Gemini, non-R Cohere |
| OCI Generative AI | OCIModel |
OCI SDK transport — required for Cohere R-series |
| OpenAI | OpenAIModel |
GPT-4o, o1, o3, gpt-5.x against the direct API |
| Anthropic | AnthropicModel |
Claude models |
| Ollama | OllamaModel |
Local LLMs (Llama, Mistral, Gemma) |
The registry helper get_model("provider:model_name") routes OCI ids
to the right transport automatically.
Run it (OCI Generative AI is the default; auto-detected from ~/.oci/config):
python examples/notebook_56_model_providers.py
Offline:
LOCUS_MODEL_PROVIDER=mock python examples/notebook_56_model_providers.py
Pin a specific OCI model:
LOCUS_MODEL_ID=meta.llama-3.3-70b-instruct python examples/notebook_56_model_providers.py
Source¶
# Copyright (c) 2025, 2026 Oracle and/or its affiliates.
# Licensed under the Universal Permissive License v1.0 as shown at
# https://oss.oracle.com/licenses/upl/
"""Notebook 51: Model providers — OCI Generative AI is the default; others plug in.
Locus targets OCI Generative AI as its primary provider, and OpenAI,
Anthropic, and Ollama as first-class alternatives. The same Agent code
works against any of them — only the model object changes.
Provider matrix (in order of priority):
OCI GenAI — OCIChatCompletionsModel /openai/v1 transport for OpenAI, Meta, xAI,
Mistral, Gemini, and non-R Cohere; OCIModel SDK transport for Cohere
R-series.
OpenAI — GPT-4o, o1, o3, gpt-5.x against the direct API.
Anthropic — Claude models.
Ollama — Local LLMs (Llama, Mistral, Gemma).
Registry — get_model("provider:model_name") routes OCI ids to the
right transport automatically.
Run it
# Default: OCI Generative AI auto-detected from ~/.oci/config
python examples/notebook_56_model_providers.py
# Offline / no credentials:
LOCUS_MODEL_PROVIDER=mock python examples/notebook_56_model_providers.py
# Pin a specific OCI model:
LOCUS_MODEL_ID=meta.llama-3.3-70b-instruct python examples/notebook_56_model_providers.py
"""
import asyncio
import time
from config import get_model as get_configured_model
from locus.agent import Agent
from locus.models.registry import get_model, list_providers
def _llm_call(
prompt: str, *, system: str = "Reply in one short sentence.", max_tokens: int = 80
) -> str:
"""One model call with a timing/token banner — used by every part."""
agent = Agent(model=get_configured_model(max_tokens=max_tokens), system_prompt=system)
t0 = time.perf_counter()
res = agent.run_sync(prompt)
dt = time.perf_counter() - t0
print(
f" [model call: {dt:.2f}s · {res.metrics.prompt_tokens}→{res.metrics.completion_tokens} tokens]"
)
return res.message.strip()
# Part 1: list every provider the registry knows about.
def example_providers():
"""List available model providers."""
print("=== Available providers ===\n")
providers = list_providers()
print(f"Registered providers: {providers}")
print(
f"AI rationale: {_llm_call('In one sentence, why do AI SDKs ship a model registry instead of hard-coding one provider?')}"
)
print("\nUsage (OCI first):")
print(
' model = get_model("oci:openai.gpt-4.1", profile="DEFAULT") # → OCIChatCompletionsModel'
)
print(
' model = get_model("oci:meta.llama-3.3-70b-instruct") # → OCIChatCompletionsModel'
)
print(
' model = get_model("oci:cohere.command-r-plus", '
'profile_name="DEFAULT", auth_type="api_key") # → OCIModel'
)
print(' model = get_model("openai:gpt-4o")')
print(' model = get_model("anthropic:claude-sonnet-4-20250514")')
print(' model = get_model("ollama:llama3.3")')
print()
print("The 'oci:' prefix auto-routes by model family — 'cohere.command-r-*'")
print("uses OCIModel (SDK transport), everything else uses OCIChatCompletionsModel")
print("(/openai/v1). See docs/how-to/oci-models.md.")
# Part 2: instantiate each provider directly, without the registry.
def example_direct():
"""Use providers directly without the registry."""
print("\n=== Direct provider usage ===\n")
print(
f"AI rationale: {_llm_call('In one sentence, when would you instantiate OCIChatCompletionsModel directly instead of via the registry?')}"
)
# OCI v1 transport — the path used by every OpenAI / Meta / xAI /
# Mistral / Gemini model on OCI GenAI.
print("OCI GenAI — V1 (/openai/v1):")
print(" from locus.models import OCIChatCompletionsModel")
print(' model = OCIChatCompletionsModel(model="openai.gpt-4.1", profile="DEFAULT")')
print()
print(" # Workload identity on OCI VM / OKE / Functions:")
print(" model = OCIChatCompletionsModel(")
print(' model="openai.gpt-4.1",')
print(' auth_type="instance_principal", # or "resource_principal"')
print(' compartment_id="ocid1.compartment.oc1...",')
print(" )")
# OCI SDK transport — only Cohere R-series needs this.
print("\nOCI GenAI — SDK (/20231130/actions/v1, Cohere R-series only):")
print(" from locus.models import OCIModel")
print(" model = OCIModel(")
print(' model_id="cohere.command-r-plus-08-2024",')
print(' profile_name="DEFAULT",')
print(' auth_type="api_key",')
print(" )")
print("\nOpenAI (direct API, requires OPENAI_API_KEY):")
print(" from locus.models import OpenAIModel")
print(' model = OpenAIModel(model="gpt-4o")')
print("\nAnthropic (requires ANTHROPIC_API_KEY):")
print(" from locus.models.native.anthropic import AnthropicModel")
print(' model = AnthropicModel(model="claude-sonnet-4-20250514")')
print("\nOllama (requires local Ollama server):")
print(" from locus.models.native.ollama import OllamaModel")
print(' model = OllamaModel(model="llama3.3")')
async def example_live_call() -> None:
"""Call whichever provider is configured in the environment."""
print("\n=== Live provider call ===\n")
model = get_configured_model(max_tokens=80)
agent = Agent(
model=model,
system_prompt="Reply with one short sentence.",
)
import time as _t
t0 = _t.perf_counter()
result = agent.run_sync("Name two strengths of OCI Generative AI.")
dt = _t.perf_counter() - t0
print(f" Model class: {type(model).__name__}")
print(f" Reply: {result.message.strip()}")
print(
f" [model call: {dt:.2f}s · {result.metrics.prompt_tokens}→{result.metrics.completion_tokens} tokens]"
)
if __name__ == "__main__":
example_providers()
example_direct()
asyncio.run(example_live_call())