Skip to content

Appendix A: Glossary

This glossary includes core concepts and common terminology in the AI programming field, helping you quickly understand technical documentation and industry discussions.

Basic Concepts

English TermChinese TranslationBrief Description
AI (Artificial Intelligence)Artificial IntelligenceTechnical field enabling machines to simulate human intelligent behavior
ML (Machine Learning)Machine LearningSubset of AI, training models through data rather than explicit programming
DL (Deep Learning)Deep LearningMachine learning approach based on neural networks
LLM (Large Language Model)Large Language ModelLarge-scale neural network models trained on massive text data
NLP (Natural Language Processing)Natural Language ProcessingTechnology enabling computers to understand and generate human language
TransformerTransformer ArchitectureCore architecture of modern LLMs, based on attention mechanism
TokenTokenBasic unit of text processing by models, typically word or subword
EmbeddingEmbedding/VectorizationProcess of converting text to numerical vectors
Fine-tuningFine-tuningContinued training on pre-trained models for specific tasks
Pre-trainingPre-trainingInitial training stage on large-scale data
Context WindowContext WindowMaximum number of tokens a model can process at once
Neural NetworkNeural NetworkComputational model mimicking biological neurons
Training DataTraining DataDataset used to train models
Model ParametersModel ParametersNumber of learnable weights in a neural network

Models and Inference

English TermChinese TranslationBrief Description
GPT (Generative Pre-trained Transformer)Generative Pre-trained TransformerOpenAI's LLM series (GPT-4.1, GPT-5, etc.)
ClaudeClaudeAnthropic's LLM series (Claude Sonnet 4.6, Opus 4.6)
GeminiGeminiGoogle's multimodal LLM series (Gemini 2.5, 3.1 Pro)
DeepSeekDeepSeekOpen-source LLM with excellent performance (V3.2, R1)
TemperatureTemperature ParameterControls output randomness, 0=deterministic, 2=high randomness
Top-p (Nucleus Sampling)Nucleus SamplingSample from candidate words with cumulative probability reaching p
Max TokensMax TokensMaximum length limit for generated responses
InferenceInferenceProcess of using trained model to generate output
Reasoning ModelReasoning ModelModels with enhanced logical reasoning capabilities (e.g., o1)
Chain-of-Thought (CoT)Chain-of-ThoughtTechnique for models to show reasoning process step by step
Few-Shot LearningFew-Shot LearningGuiding model with few examples in prompt
Zero-Shot LearningZero-Shot LearningHaving model execute task without providing examples
MultimodalMultimodalCan process multiple inputs like text, images, audio
Vision ModelVision ModelModels that can understand and analyze images

Development Concepts

English TermChinese TranslationBrief Description
Prompt EngineeringPrompt EngineeringTechnique of designing effective prompts to get desired outputs
Context EngineeringContext EngineeringOptimizing input context to enhance model performance
Vibe CodingVibe CodingProgramming collaboratively with AI through natural language
AI-First DevelopmentAI-First DevelopmentSoftware development paradigm with AI at its core
CopilotCopilotGeneral term for AI-assisted programming tools
Function CallingFunction CallingCapability for LLM to call external functions or APIs
Tool UseTool UseEnabling AI to use external tools to extend capabilities
Structured OutputStructured OutputForcing model output to conform to specific JSON Schema
StreamingStreamingReturning generated results progressively rather than waiting for completion
SSE (Server-Sent Events)Server-Sent EventsHTTP protocol for implementing streaming responses
System PromptSystem PromptInitial instructions defining AI role and behavior
User MessageUser MessageInput content sent to AI by user
Assistant MessageAssistant MessageResponse content returned by AI
Token LimitToken LimitMaximum token count per request
English TermChinese TranslationBrief Description
AI AgentAI AgentAI system capable of autonomous decision-making and task execution
ReAct (Reasoning + Acting)Reasoning-Action PatternFramework where agents alternate between reasoning and action
Multi-AgentMulti-AgentMultiple agents collaborating to complete complex tasks
Sub-AgentSub-AgentSpecialized agents called by main agent
HandoffHandoffTransfer of task control between agents
Computer UseComputer UseAI directly operating computer interface (e.g., Claude)
Agentic AIAgentic AIAI systems with autonomy
PlanningPlanningAgent's ability to formulate task execution plans
MemoryMemoryAgent's ability to store and retrieve historical information
ReflectionReflectionAgent's ability to evaluate and improve own behavior
Autonomous AgentAutonomous AgentCan complete tasks independently without human intervention
Tool AgentTool AgentAgent specialized in calling external tools

Protocols and Infrastructure

English TermChinese TranslationBrief Description
MCP (Model Context Protocol)Model Context ProtocolAI context standard protocol proposed by Anthropic
A2A (Agent-to-Agent)Agent-to-Agent ProtocolAgent communication standard proposed by Google
ANP (Agent Network Protocol)Agent Network ProtocolAgent interconnection standard proposed by OpenAI
RAG (Retrieval-Augmented Generation)Retrieval-Augmented GenerationGeneration technique combining external knowledge bases
Vector DatabaseVector DatabaseSpecialized database for storing and retrieving vector data
Semantic SearchSemantic SearchSearch based on semantic similarity rather than keywords
ChunkingChunkingSplitting long documents into smaller chunks for processing
RerankingRerankingRe-sorting retrieval results by relevance
API GatewayAPI GatewayUnified management and routing of API requests
Load BalancingLoad BalancingDistributing requests across multiple service instances

Production

English TermChinese TranslationBrief Description
GuardrailsGuardrailsMechanisms limiting model output range and quality
EvaluationEvaluationMeasuring model performance and output quality
ObservabilityObservabilityAbility to monitor and debug AI systems
Prompt CachingPrompt CachingCaching common prompts to reduce cost and latency
Batch APIBatch APIAsynchronously batch processing requests to reduce cost
Prompt InjectionPrompt InjectionAttacks where malicious users try to manipulate model behavior
JailbreakJailbreakTechniques to bypass model security restrictions
Rate LimitingRate LimitingLimiting API call frequency to prevent abuse
Cost OptimizationCost OptimizationStrategies to reduce AI application operating costs
Model VersioningModel VersioningManaging deployment and switching of different model versions
A/B TestingA/B TestingComparing effectiveness of different models or prompts
FallbackFallbackBackup plan when main service fails
MonitoringMonitoringReal-time tracking of system running status and metrics
LoggingLoggingRecording request, response, and error information
ResourceDescriptionLink
learn-claude-codeBuild a Claude Code-like agent from scratch in 11 sessionsGitHub
hello-agentsComprehensive guide to building AI agents from zero (Chinese)GitHub
OpenAI CookbookOfficial OpenAI code examples and best practicesGitHub
Anthropic DocsClaude API documentation and prompt engineering guidedocs.anthropic.com
LangChain DocsPopular LLM application framework documentationpython.langchain.com
MCP SpecificationModel Context Protocol official specificationmodelcontextprotocol.io

Last updated: 2026-02-20

An AI coding guide for IT teams