GeniSpace Agent Overview

GeniSpace agents are the core of the platform, powered by advanced large language models (LLMs) with long-term memory capabilities, chain-of-thought reasoning, and multimodal processing — enabling them to assist and automate your work in unprecedented ways.

Creation Process Overview (Aligned with Training Materials)

When creating an agent in the Console, the basic process is as follows:

Console → Agents → Create Agent
Enter prompts: The system prompt defines the agent's role and behavior; prompt templates can define input/output formats
Input schema and output schema: Task agents require JSON Schema-formatted input/output configuration
Enable memory: Configure memory parameters to support multi-turn conversation context
Test and debug: Verify agent behavior in the test area on the configuration page or in the Chat application

Conversational agents can be tested with multi-turn conversations in the Chat application.

Agent Types

GeniSpace provides two different types of agents to serve different use cases:

Conversational Agent (CHAT)

Continuous Dialogue: Supports multi-turn conversations, maintains context, and provides personalized responses
Memory Capabilities: Automatically remembers conversation history and understands long-term context
Personalized Experience: Configurable welcome messages and conversation styles
Use Cases: Customer service, consulting Q&A, creative collaboration

Task Agent (TASK)

Structured Input/Output: Supports JSON Schema validation for data consistency
Strict Mode: Provides input/output validation and strict response formatting
Efficient Execution: Focused on executing specific tasks and delivering results
Use Cases: Data processing, API integration, automated workflows

Core Features

Advanced Chain-of-Thought Reasoning

GeniSpace agents are equipped with advanced chain-of-thought capabilities, giving you clear visibility into the AI's reasoning process:

Visualized Reasoning: Real-time display of the agent's thinking steps and decision-making process
Multi-Round Iteration: Supports step-by-step execution of complex tasks, with up to 15 reasoning iterations
Intelligent Task Planning: Automatically decomposes complex tasks into executable steps
Transparent Decision-Making: Each decision node includes detailed reasoning explanations

Multimodal Processing

Agents support multiple content types for input and processing:

Text Processing: Natural language understanding and generation
Image Recognition: Supports image uploads and visual content analysis
Audio Processing: Understanding and processing of audio files
Mixed Content: Handles composite tasks involving text, images, and audio simultaneously

Intelligent Memory System

Three memory isolation levels are supported, adapting to different use cases:

Session Isolation: Each conversation session has independent memory
User Isolation: Memory is shared across all sessions for the same user
Team Isolation: Memory is shared across all team members

Memory system parameters can be fine-tuned:

Maximum recent conversation messages (10–200)
Historical memory retrieval count (1–20)
Important turns count (3–50)

MCP Tool Ecosystem

Agents integrate powerful tool invocation capabilities:

Built-in Tools

HTML Content Rendering: Display rich text content directly in conversations
Chart Generation Tools: Create various data visualization charts
Table Display Tools: Present structured data in formatted tables
More Built-in Tools: Continuously expanding tool library

Platform Tools

Operator Tools: Access platform-provided operators and user-defined custom operators
- Supports all operators or specifying specific ones
- View operator details, methods, and descriptions
- Automatically distinguishes custom and system operators
Task Tools: Use platform tasks as callable tools
- Supports all tasks or specifying specific ones
- Access SCHEDULED, EVENT, MANUAL, and other task types
- Agents can create and execute tasks
Data Source Tools: Use platform data sources as tools
- Supports all data sources or specifying specific ones
- Access READ, CREATE, UPDATE, DELETE, and other operation types
- Agents can directly query and manipulate data sources
Flexible Configuration: Each tool type supports three selection strategies: "None", "All", and "Specified"

External MCP Servers

Third-Party Integration: Connect to external MCP protocol-compatible services
Custom Tools: Integrate your own developed tools and services
API Key Management: Secure third-party service authentication

Display Plugin System

Agents support a powerful display plugin system for customizing how tool output results are rendered:

Local Plugins

Built-in Plugins: System-built-in plugins such as expense reimbursement, travel expenses, URL redirect, iframe embedding, etc.
Auto-Loading: No configuration needed — automatically available
Rich Functionality: Supports forms, charts, interactive components, and more

Remote Plugins

Dynamic Loading: Dynamically load standalone plugin projects via HTTP URLs
Flexible Deployment: Supports GitHub Pages, CDN, self-hosted servers, enterprise intranets, and more
Independent Development: Plugins can be developed and deployed independently without redeploying the chat system
Version Management: Supports independent versioning and updates
Enterprise Customization: Enterprises can develop custom plugins with full control over UI and interactions

Plugin Features

Auto-Discovery: Supports automatic plugin discovery and registration
Hot Reloading: Supports plugin hot-reload in development mode for improved development efficiency
Fallback Strategy: Automatically falls back to local plugins or default renderers if remote plugins fail to load
Security Controls: Supports plugin URL whitelisting for security

For detailed plugin development and usage guides, see Developer Resources (includes chat plugins).

Web Search Capabilities

When chain-of-thought is enabled, agents can perform real-time web searches:

Real-Time Information Retrieval: Search for the latest web information and data
Search Controls: Configurable maximum search count (1–10) and results per search (1–20)
Intelligent Searching: Automatically constructs search queries based on conversation context

Knowledge Base Integration

Agents can connect to your private knowledge bases:

Document Retrieval: Find relevant information from uploaded documents
Intelligent Matching: Semantic similarity-based matching for the most relevant content
Multi-Knowledge Base Support: Connect to multiple knowledge bases simultaneously

Associate a knowledge base in the agent configuration to enable RAG (Retrieval-Augmented Generation). See Knowledge Base Overview for details.

Agent Use Cases

Intelligent Conversational Assistants

Customer Service: 24/7 automatic responses to customer inquiries with complex requirement understanding
Consulting Advisor: Provide advice and recommendations based on domain expertise
Learning Partner: Assist with learning and research, answering professional questions
Creative Collaboration: Participate in brainstorming and provide creative ideas

Task Automation

Data Processing: Extraction, transformation, and analysis of structured data
Document Generation: Automatically generate reports based on templates and data
API Integration: Act as a middleware layer for handling complex API calls
Workflow Coordination: Chain multiple tools and services to complete complex tasks

Content Creation

Multimedia Content Analysis: Process mixed content including text, images, and audio
Intelligent Editing: Content rewriting, formatting, and optimization
Translation Services: Multilingual content understanding and conversion
Creative Generation: Generate creative content based on input materials

How Agents Work

Chain-of-Thought Execution Flow

Input Understanding: The agent first analyzes the user's multimodal input
Task Planning: Decomposes complex tasks into executable steps
Tool Selection: Intelligently selects appropriate tools based on task requirements
Iterative Execution: Executes tasks step by step, adjusting strategy in real time
Result Consolidation: Aggregates results from each step to generate the final response

Memory Management Mechanism

Recent Conversation Memory: Saves the complete context of the current session
Historical Memory Retrieval: Finds relevant historical interactions through vector search
Important Information Extraction: Automatically identifies and saves key information
Isolation Level Control: Manages memory scope according to the configured isolation level

Streaming Response Processing

Real-Time Output: Thinks and outputs simultaneously — users see progress in real time
Thought Visualization: Displays the agent's reasoning steps and tool calls
Interruption and Resume: Supports user interruption and retry mechanisms
Error Handling: Intelligent retry and error recovery

How to Configure an Agent

The GeniSpace platform provides flexible agent configuration options:

Basic Settings

Agent Type Selection: Choose between conversational or task agent
Basic Information: Set the name, description, and tags
Model Selection: Choose the most suitable AI model from multiple options
Model Parameter Tuning: Configure temperature, maximum tokens, and other parameters

Advanced Configuration

Chain-of-Thought Settings

Enable Chain-of-Thought: Activate advanced reasoning and tool invocation capabilities
Maximum Iterations: Set the iteration limit for complex reasoning (5–50)
Task Planning: Enable intelligent task decomposition

Memory Configuration

Memory Isolation Level: Choose session, user, or team-level memory isolation
Memory Recall Parameters:
- Maximum recent conversation messages (10–200)
- Historical memory retrieval count (1–20)
- Important turns count (3–50)

Tool Configuration

Built-in Tools: Enable HTML rendering, chart generation, and other built-in tools
Platform Tools: Configure access strategies for operators, tasks, and data sources
- Operator Selection: None / All operators / Specified operators
- Task Selection: None / All tasks / Specified tasks
- Data Source Selection: None / All data sources / Specified data sources
External MCP Servers: Integrate third-party tools and services
Display Plugins: Configure rendering plugins for agent output results
- Supports local and remote plugins
- Remote plugins can be dynamically loaded via URL
- Supports auto-discovery and hot reloading

Web Search (Requires Chain-of-Thought)

Search Controls: Set the maximum number of searches (1–10)
Result Count: Configure the number of results per search (1–20)

Knowledge Base Integration

Knowledge Base Selection: Connect relevant knowledge bases
Document Filtering: Select specific document collections

Task Agent-Specific Configuration

Input Schema: Define JSON Schema-formatted input structure
Output Schema: Specify the expected output format and structure
Validation Settings: Enable input/output validation and strict mode

Agent Security & Privacy

GeniSpace places a high priority on data security and privacy protection:

Data Encryption: All data processed by agents is end-to-end encrypted
Access Control: Role-based permission management ensures secure data access
Memory Isolation: Multi-level memory isolation mechanisms protect user data privacy
Tool Security: Security verification and monitoring for MCP tool invocations
Audit Logs: Complete records of agent operations and decision processes
Compliance: Meets major privacy regulations including GDPR and CCPA

API Integration

GeniSpace provides flexible agent APIs supporting multiple integration methods:

Multimodal Chat API

A chat interface supporting text, images, audio, and other content types:

const response = await fetch(`/developer/api-agents/${agentId}/chat`, {
  method: "POST",
  headers: {
    "Authorization": "Bearer YOUR_API_KEY",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    contents: [
      {
        type: "text",
        text: "Please analyze this image"
      },
      {
        type: "image_url",
        image_url: {
          url: "data:image/jpeg;base64,/9j/4AAQSkZ...",
          detail: "auto"
        }
      }
    ],
    session_id: "session_12345",
    stream: true,
    settings: {
      temperature: 0.7,
      max_tokens: 2000
    }
  })
});

Task Execution API

For task agents with structured input/output:

const response = await fetch(`/developer/api-agents/${agentId}/execute`, {
  method: "POST",
  headers: {
    "Authorization": "Bearer YOUR_API_KEY",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    // Provide structured data according to the agent's input schema
    user_input: "Process this sales data",
    data_source: "sales_report_q3.csv",
    output_format: "summary_report",
    
    // Optional session and configuration
    session_id: "task_session_789",
    enable_memory: true,
    enable_thinking_chain: true
  })
});

Streaming Response Processing

GeniSpace agents support real-time streaming responses, allowing you to receive the thinking process and results in real time:

// Handle streaming responses using Server-Sent Events
const eventSource = new EventSource(`/developer/api-agents/${agentId}/chat/stream`);

eventSource.onmessage = function(event) {
  const data = JSON.parse(event.data);
  
  switch(data.type) {
    case 'content_delta':
      // Incremental content update
      appendToMessage(data.content);
      break;
      
    case 'thinking':
      // Chain-of-thought step
      displayThinkingStep(data.step);
      break;
      
    case 'tool_call':
      // Tool call status
      updateToolStatus(data.tool_name, data.status);
      break;
      
    case 'complete':
      // Response complete
      finalizeMessage(data.final_content);
      eventSource.close();
      break;
  }
};

Session Management API

Manage agent conversation sessions:

// Create a new session
const session = await fetch(`/developer/api-agents/${agentId}/sessions`, {
  method: "POST",
  headers: {
    "Authorization": "Bearer YOUR_API_KEY",
    "Content-Type": "application/json"
  },
  body: JSON.stringify({
    title: "Data Analysis Session",
    sessionType: "chat"
  })
});

// Get session history
const history = await fetch(`/developer/api-agents/sessions/${sessionId}/messages`, {
  headers: {
    "Authorization": "Bearer YOUR_API_KEY"
  }
});

// Clear session memory
await fetch(`/developer/api-agents/${agentId}/memory/session/${sessionId}`, {
  method: "DELETE",
  headers: {
    "Authorization": "Bearer YOUR_API_KEY"
  },
  body: JSON.stringify({
    isolation_level: "session"
  })
});

API Feature Summary

Multimodal Support: Simultaneously process text, images, audio, and other content
Streaming Responses: Receive the AI's thinking process and output in real time
Session Persistence: Automatically saves conversation history and context
Memory Management: Supports multi-level memory isolation and clearing
Chain-of-Thought Visualization: Provides detailed reasoning step information
Tool Call Tracking: Monitor tools used by the agent and their status

For detailed API documentation, see Agent API.

Getting Started with Agents

Getting started with GeniSpace agents is straightforward:

Step 1: Create an Agent

Access the Agent Management Page: Navigate to the "Agents" section in the Console
Select Agent Type: Choose conversational or task agent based on your needs
Basic Configuration:
- Set the agent name and description
- Select an appropriate AI model
- Add feature tags

Step 2: Advanced Configuration

Enable Chain-of-Thought: Activate advanced reasoning and tool invocation capabilities
Configure the Memory System: Select memory isolation level and recall parameters
Integrate Tools:
- Enable built-in tools (charts, tables, HTML rendering)
- Configure platform tools:
  - Select operators (all or specified)
  - Select tasks (all or specified)
  - Select data sources (all or specified)
- Configure external MCP servers
- Configure display plugins (local or remote)
Knowledge Base Connection: Upload documents or connect existing knowledge bases
Web Search Settings: Configure search parameters (optional)

Step 3: Test and Deploy

Functional Testing: Test agent functionality directly on the configuration page
Conversation Testing: Validate agent responses through the chat interface
API Integration: Obtain API keys and integrate into your applications
Monitor and Optimize: Observe usage patterns and adjust configuration

Quick Start Templates

We provide multiple pre-configured agent templates:

Customer Service Assistant: Configured with customer service knowledge bases and tools
Data Analyst: Integrated with data processing and visualization tools
Content Creator: Optimized for text generation and editing
Technical Consultant: Connected to technical documentation and problem-solving tools

Getting Help

Documentation Center: View detailed configuration guides and best practices
API Documentation: Refer to the complete Agent API documentation
Community Support: Get help and share experiences in the user forum
Technical Support: Contact our technical team for professional support

Ready to explore more? Check out our Workflow Engine, Tool System, and API Reference to learn how to fully leverage GeniSpace's complete feature set.

Next Steps

Explore more agent features:

Learn about the MCP Tool Invocation System — detailed configuration and usage
Study the Agent Memory System — memory management mechanisms
Master Developer Resources — customize tool output rendering
Explore API Integration — integrate agents into your applications

Related guides:

Workflow Engine: Learn how agents work with workflows
Tool System: Extend agent capabilities
Applications Overview: Learn about sub-applications like Chat

Creation Process Overview (Aligned with Training Materials)​

Agent Types​

Conversational Agent (CHAT)​

Task Agent (TASK)​

Core Features​

Advanced Chain-of-Thought Reasoning​

Multimodal Processing​

Intelligent Memory System​

MCP Tool Ecosystem​

Built-in Tools​

Platform Tools​

External MCP Servers​

Display Plugin System​

Local Plugins​

Remote Plugins​

Plugin Features​

Web Search Capabilities​

Knowledge Base Integration​

Agent Use Cases​

Intelligent Conversational Assistants​

Task Automation​

Content Creation​

How Agents Work​

Chain-of-Thought Execution Flow​

Memory Management Mechanism​

Streaming Response Processing​

How to Configure an Agent​

Basic Settings​

Advanced Configuration​

Chain-of-Thought Settings​

Memory Configuration​

Tool Configuration​

Web Search (Requires Chain-of-Thought)​

Knowledge Base Integration​

Task Agent-Specific Configuration​

Agent Security & Privacy​

API Integration​

Multimodal Chat API​

Task Execution API​

Streaming Response Processing​

Session Management API​

API Feature Summary​

Getting Started with Agents​

Step 1: Create an Agent​

Step 2: Advanced Configuration​

Step 3: Test and Deploy​

Quick Start Templates​

Getting Help​

Next Steps​

Creation Process Overview (Aligned with Training Materials)

Agent Types

Conversational Agent (CHAT)

Task Agent (TASK)

Core Features

Advanced Chain-of-Thought Reasoning

Multimodal Processing

Intelligent Memory System

MCP Tool Ecosystem

Built-in Tools

Platform Tools

External MCP Servers

Display Plugin System

Local Plugins

Remote Plugins

Plugin Features

Web Search Capabilities

Knowledge Base Integration

Agent Use Cases

Intelligent Conversational Assistants

Task Automation

Content Creation

How Agents Work

Chain-of-Thought Execution Flow

Memory Management Mechanism

Streaming Response Processing

How to Configure an Agent

Basic Settings

Advanced Configuration

Chain-of-Thought Settings

Memory Configuration

Tool Configuration

Web Search (Requires Chain-of-Thought)

Knowledge Base Integration

Task Agent-Specific Configuration

Agent Security & Privacy

API Integration

Multimodal Chat API

Task Execution API

Streaming Response Processing

Session Management API

API Feature Summary

Getting Started with Agents

Step 1: Create an Agent

Step 2: Advanced Configuration

Step 3: Test and Deploy

Quick Start Templates

Getting Help

Next Steps