Explore the key innovations in this roadmap, which highlight upcoming improvements and advancements.
This section lists features and improvements still in development, with no confirmed release date.
- PyGEAI v0.7.0
- The pygeai.proxy module will be deprecated and moved to a standalone package. You must migrate to the new pygeai-proxy package.The proxy functionality for managing MCP and A2A servers has been extracted from the main pygeai package into a dedicated pygeai-proxy package. This change provides better separation of concerns and allows the proxy functionality to evolve independently.
- API
- New /videos/generations endpoint so you can generate videos via Vertex AI’s Veo 2, with proxy handling for long-running operations and MP4 decoding.
- New token management API so you can create, revoke, and update project-level API tokens using secure, role-validated endpoints (project tokens cannot manage tokens).
- New secure endpoints to list Organizations and Projects by user email that use OAuth and restrict access to ProvisioningServices or GAM Administrator roles, so you manage roles effectively across platforms.
- Improved portability via CLI and Python SDK export that lets you programmatically export Agents, Tools, and Agentic Processes for backup and reuse.
- Improved observability for the Data Analyst Assistant that forwards logs and traces via OpenTelemetry, so you correlate requests using trace_id, span_id, and trace_flags across your monitoring tools.
- Console
- Prompt Files: It allows you to upload files at the Organization and Project level so that the Chat Assistant you define can use them to answer questions.
- Quota Alerts: Email notifications will be sent when a Project or Organization reaches its soft limit, helping teams manage usage proactively.
- Model Configuration Controls: Projects will gain the ability to define which LLMs are enabled, improving governance and cost management.
- Evaluation module backoffice.
- Security consent Flow so you must read and accept Terms and Conditions in Agentic Hub and Console, with identity, date, and time recorded; consent persists across future logins.
- Improved governance with token consumption usage limits so you can set quotas in millions of tokens at Organization and Project level for tighter control and accountability.
- Improved global navigation consistency that introduces a standardized navigation bar across Console, Workspace, and Agentic Hub with unified links, placement, and accessibility.
- Hubs
- Options to export/import Agentic Processes and Flows.
- RAG Assistant and API Assistant will migrate from the Console to Agentic Hub:
- This gives Assistants access to advanced configuration, custom Integrations, and a flexible development workflow in Agentic Hub.
- New RAG Agents so you can connect Agents to Knowledge sources (document repositories, vector stores, databases, or graphs) and configure retrieval Integrations. You link Knowledge to prompts and Integrations, create Agents with templates or via an Assisted flow, and start chatting after uploading documents.
- New RAG‑enabled Agents that let you define Knowledge sources, attach retrieval tools, and chat over documents, databases, vectors, or graphs, with a simple path to migrate from current RAG Assistants.
- Knowledge abstracts your sources.
- Retrieval tools define access, chunking, and prompts.
- Agents tie prompts, tools, and selected Knowledge for seamless RAG.
- Audit Logs: Agentic Hub will begin tracking user actions related to entity creation, updates, and deletions—strengthening traceability and accountability.
- Entity Version Management: Users will be able to view the version history of any Agentic Hub entity and restore previous versions when needed.
- New configuration option in the Tool to generate images.
- New per-project avatar prompt templates with dynamic variables so you can differentiate brand and style across projects while preserving agent name and role; the generator compiles the project prompt with safe defaults and runtime variables.
- New prompt files in agents so you can reference uploaded prompt files directly inside an agent’s prompt to enrich context.
- Security guardrails per agent so you can prevent prompt injection and moderate both inputs and outputs with versioned checks validated against supported providers.
- Import MCP remote servers as Integrations with auto-detected authentication and optional auto-registration.
- Improved autosave behavior so you can keep work safe during connection loss.
- Use an offline/local autosave buffer that syncs when the connection resumes.
- See clear status feedback such as “Saving…”, “Last saved…”, “Saved offline – syncing soon”, or “Save failed – retry”.
- Flows
- Improved BPMN compliance in the agentic process engine that enables throw/catch handling for intermediate signal events, so you coordinate process parts reliably.
- New deterministic Gateway conditions that let you define boolean routing rules using process variables (AND/OR), including natural language entry that the system translates into verifiable expressions you can review and edit.
- Improved editing confidence that adds resilient autosave with a local offline buffer and clear status messages (e.g., Saving…, Last saved…), so you avoid losing work during disconnects or session expiry.
- LLMs
- Improved cost accuracy with conditional handling based on tokens consumed so you can align billing to provider models. Support includes Vertex AI (e.g., Gemini 2.5 family, including audio tokens), cached-token pricing, and regional pricing for providers such as OpenAI and Azure OpenAI.
- Improved cost accuracy that applies provider‑specific conditional pricing based on token counts (including audio and cached tokens) and regional pricing for providers such as Vertex AI, OpenAI, and Azure OpenAI.
- VoiceAgents: Real-Time Conversational AI with Speech
- VoiceAgents introduces real-time, voice-based interactions with AI agents, enabling natural, two-way conversations through speech. Powered by advanced audio transcription, natural language understanding, and text-to-speech synthesis, this feature allows users to speak directly with AI agents and receive immediate spoken responses, bringing human-AI interaction to a whole new level of fluidity and accessibility.