19+ LLM Models — One Interface

Every agent gets its own model assignment. Use a lightweight model for simple FAQ bots and a frontier model for complex reasoning — optimizing cost and quality across your organization.

Claude Opus 4

Anthropic's most capable model for complex analysis, long-form content generation, and nuanced reasoning. Ideal for agents that need to synthesize large documents, write detailed reports, or handle multi-step logic chains where accuracy is paramount.

Claude Sonnet 4

The best balance of intelligence and speed in the Claude family. Sonnet handles most enterprise tasks — summarization, data extraction, customer support, code review — at a fraction of Opus cost with response times under two seconds.

Claude Haiku 3.5

Anthropic's fastest and most affordable model. Deploy Haiku for high-volume, low-latency use cases like chat widgets, classification tasks, and simple Q&A agents where sub-second response time matters more than deep reasoning.

GPT-5

OpenAI's flagship model offering state-of-the-art performance across coding, math, and creative writing. Excels at tool use and structured output generation. A strong choice for agents that need to produce JSON, call APIs, or write production-quality code.

GPT-4.1

Optimized for instruction-following and long-context tasks with a 1M-token context window. GPT-4.1 is well-suited for agents that process lengthy documents, codebases, or conversation histories where retaining every detail matters.

o3 & o4-mini

OpenAI's reasoning-optimized models that think step by step before answering. Use them for agents that tackle math problems, multi-constraint planning, scientific analysis, or any task where chain-of-thought reasoning dramatically improves accuracy.

Plus GPT-4o, GPT-4o-mini, o3-mini, and more. New models are added within days of release — your agents always have access to the latest capabilities. Learn how to choose the right model →

Configure Every Aspect of Your Agent

Each agent in Orckai is defined by four building blocks: a system prompt that sets its personality and instructions, a model selection that determines its intelligence, tools that give it real-world capabilities, and an optional knowledge base for domain-specific answers.

System Prompts — Write detailed instructions that shape how your agent behaves. Define its role, tone, boundaries, and output format. Prompts support variable interpolation with {{variable}} syntax for dynamic personalization.
Tool Attachment — Give your agent hands. Attach MCP tools for database queries, built-in tools for web search and calculations, or utility tools for sending emails and creating Jira tickets — all without writing integration code.
Knowledge Base (RAG) — Upload PDFs, Word documents, Excel spreadsheets, and 50+ other file types. Orckai chunks, embeds, and indexes them so your agent retrieves relevant context before every response, with inline source citations.
Per-Agent Model Selection — Assign different LLMs to different agents. Your customer support agent can use fast, affordable Haiku while your research agent uses powerful Opus — optimizing both cost and quality.

Create Your First Agent

Agent Definition

name: "Customer Support Agent"

model: claude-sonnet-4-20250514

system_prompt: |
You are a helpful support agent for Orckai.
Answer questions using the attached knowledge base.
Always cite your sources with [Source: filename].
If unsure, say so — never fabricate answers.

tools:
- mcp/acme-postgres (database queries)
- builtin/web-search
- utility/send-email

knowledge_base: "Product Documentation"

Two Execution Modes for Every Situation

Not every task requires the same level of reasoning. Orckai gives you two distinct runtime engines so you can match the execution mode to the complexity of the job — keeping things fast when they should be fast, and thorough when they need to be thorough.

Standard Mode (V2) — A streamlined runtime for straightforward tasks. The agent receives the user query, relevant context, and available tools, then produces a single coherent response. Ideal for FAQ bots, data lookups, document summarization, and customer support agents where speed matters.
Super Mode (SuperV2) — An enhanced runtime for complex multi-step reasoning. Super mode gives the agent expanded context management, the ability to chain multiple tool calls within a single turn, and deeper introspection on intermediate results. Use it for research agents, data analysis pipelines, or any task that requires planning, executing, and synthesizing across several steps.
Seamless Switching — Change the execution mode of any agent at any time with a single dropdown selection. No code changes, no redeployment. Test both modes on the same agent to find the right balance of speed and capability.

Execution Comparison

Standard (V2)

Query → Context + Tools → Single Response
Best for: FAQ, lookups, simple Q&A
Avg response: ~1-3 seconds

Super (SuperV2)

Query → Plan → Tool Call 1 → Reflect → Tool Call 2 → Synthesize → Response
Best for: Research, analysis, multi-step tasks
Avg response: ~5-15 seconds (depth varies)

Connect Your Agent to the Real World

An AI agent without tools is just a chatbot. Orckai lets you attach three categories of tools to any agent, turning it into an autonomous worker that can query databases, call APIs, search the web, send emails, and manipulate files — all governed by the permissions you define.

Custom MCP Servers — Generate MCP servers for PostgreSQL, MySQL, SQL Server, Oracle, MariaDB, and REST APIs. Deploy them as Docker containers with automatic networking. Your agent can run live database queries and API calls without exposing credentials to the model.
Built-in Tools — Pre-configured tools for common tasks: web search for real-time information, calculator for numerical operations, and date/time utilities. Available to any agent with a single toggle.
Utility Tools — Action-oriented tools for business automation: send emails via SMTP, create and update Jira tickets, read and write files, make HTTP requests, and parse structured data. Each tool is sandboxed with configurable timeouts and size limits.
Tool Discovery — MCP servers expose their available tools through a standard discovery protocol. When you attach an MCP server, Orckai automatically lists every tool the server provides. Your agent sees tool descriptions and parameter schemas and decides when and how to call them.

Explore MCP Servers

Attached Tools

MCP: acme-postgres

Tools: query_table, list_tables, describe_schema
Status: Running (Docker)

Built-in: Web Search

Tools: search_web, fetch_url
Status: Available

Utility: Send Email

Tools: send_email (SMTP)
Status: Configured

Utility: Jira Integration

Tools: create_issue, update_issue, search_issues
Status: Configured

Real-Time Streaming & Full Observability

Watch your agent think in real time. Every response streams token by token via Server-Sent Events. Every conversation is logged. Every token is counted and costed.

SSE Streaming Responses

Agent responses are delivered via Server-Sent Events (SSE) — the same protocol used by ChatGPT and Claude. Users see tokens appear in real time instead of waiting for a complete response. Streaming works across the web UI, embedded widgets, and the public API, giving every interface a responsive, conversational feel.

Persistent Conversation History

Every agent conversation is stored and resumable. Users can pick up where they left off, and agents maintain full context from prior turns. Conversation history is organization-scoped with row-level isolation, so multi-tenant deployments keep each team's data completely separate. Browse, search, and audit past conversations from the admin panel.

Token Usage & Cost Monitoring

Orckai tracks input tokens, output tokens, and estimated cost for every agent execution. View per-agent, per-user, and per-organization breakdowns. Set up alerts when usage exceeds thresholds. Detailed metrics help you right-size model selections — switch an over-provisioned agent from Opus to Sonnet and see the cost impact immediately.

Real-World Enterprise Use Cases

Orckai agents are already powering production workloads across industries. Here are four high-impact scenarios where organizations deploy autonomous agents to reduce manual effort, accelerate response times, and unlock data that was previously siloed.

Customer Support Automation

Deploy an agent that resolves tier-1 support tickets by searching your product knowledge base, pulling customer account details from your CRM, and taking action in your helpdesk. The agent drafts contextual replies, applies the correct tags, and escalates complex or sensitive issues to human agents with a full summary attached.

Tools used: Knowledge Base RAG, Freshdesk MCP, CRM MCP, Send Email Utility

Internal IT Helpdesk

Stand up an internal-facing agent that handles the most common IT requests employees submit every day: password resets, VPN troubleshooting, software access provisioning, and account lookups. The agent queries Active Directory and your HR database via MCP servers, executes approved actions through secure tool calls, and logs every operation for compliance. Free your IT team to focus on infrastructure while the agent handles the repetitive queue.

Tools used: Active Directory MCP, HR Database MCP, Jira Utility, IT Runbook Knowledge Base

Sales Intelligence

Arm your sales team with an agent that pulls real-time pipeline data from Salesforce, enriches prospect profiles with web research, analyzes deal velocity trends, and generates personalized prospect briefings before every call. The agent can also draft follow-up emails tailored to the conversation history and schedule next steps. Sales reps spend less time on data gathering and more time closing deals.

Tools used: Salesforce MCP, Web Search, Send Email Utility, CRM Knowledge Base

Compliance & Audit

Automate regulatory compliance checks by deploying an agent that scans contracts, policies, and internal documents against your regulatory requirement library. The agent flags non-compliant sections with precise citations, generates structured audit reports with severity ratings, and tracks remediation status over time. Reduce the weeks of manual review typically required during audit season to hours of AI-assisted analysis.

Tools used: Regulatory Knowledge Base, Document Parser, PostgreSQL MCP, Audit Report Utility

Built for Enterprise

Orckai is designed from the ground up for organizations that need strict data isolation, granular access control, and full visibility into every action an agent takes. Whether you run in our cloud or deploy on your own infrastructure, the enterprise foundations are already in place.

Multi-Tenant Isolation — Every piece of data is scoped to your organization. Row-level security in PostgreSQL ensures that agents, conversations, documents, and workflows are completely invisible to other tenants. Organization ID is embedded in JWT claims and enforced at the query layer.
Role-Based Access Control (RBAC) — Define roles with fine-grained resource and action permission pairs. Control who can create agents, execute workflows, manage MCP servers, or view billing data. System roles like Developer come pre-configured, and custom roles can be tailored to your organizational structure.
Full Audit Logging — Every user action is recorded with IP address, user agent, timestamp, and request ID. Track who created, modified, or executed every agent and workflow. Audit logs are queryable and exportable for compliance reporting.
Token Usage & Cost Monitoring — Track input tokens, output tokens, and estimated cost for every agent execution at the per-agent, per-user, and per-organization level. Set budgets and alerts to prevent runaway spend.
SSO-Ready JWT Authentication — Authentication is built on JWT with 15-minute access tokens and 7-day refresh tokens. The architecture is ready for SSO integration with your identity provider, keeping agent access governed by your existing IAM policies.
Self-Hosted / On-Premises Deployment — Run the entire Orckai platform on your own infrastructure with our Docker-based deployment package. Your data never leaves your network. Includes Caddy reverse proxy, PostgreSQL, Redis, MinIO, and Prometheus monitoring out of the box.

Audit Log Entry

2026-04-15T09:42:18Z

action: agent.execute

user: sarah.chen@acme.com

agent: "Sales Intelligence Bot"

ip: 10.0.1.42

request_id: req_8f3a2b1c

RBAC Permission

role: "Developer"

permissions:

✓ agents: create, read, execute

✓ workflows: create, read, execute

✓ mcp_servers: create, read

✓ knowledge_base: read, upload

✗ billing: read, manage

✗ organization: manage

Build Intelligent AI Agents Without Writing Code