MicrocosmWorksื—ื“ืฉื ื•ืช ื•ืชื›ื ื•ืŸ ืงื•ืกืžื•ืก ื“ื™ื’ื™ื˜ืœื™
ืื•ื“ื•ืชืฆื•ืจ ืงืฉืจ
MicrocosmWorksืžื—ื“ืฉื™ื ื•ืžืชื›ื ื ื™ื ืงื•ืกืžื•ืก ื“ื™ื’ื™ื˜ืœื™

ืžืกืคืงื™ื ืคืชืจื•ื ื•ืช IT ื—ืฉื•ื‘ื™ื. ืื ื• ื ืœื”ื‘ื™ื ืžื˜ื›ื ื•ืœื•ื’ื™ื”, ืื‘ื˜ื—ื” ื•ืขื•ื–ืจื™ื ืœืขืกืงื™ื ืœืฆืžื•ื— ื‘ืืžืฆืขื•ืช ืชืฉืชื™ืช IT ืืžื™ื ื” ื•ื—ื“ืฉื ื™ืช.

[email protected]
+91 7011868196
New Delhi, India

ืžืจื›ื– ืฆืžื™ื—ื” AI

ืžืจื›ื– AIื—ื“ืฉื ื•ืช ืกื˜ืืจื˜ืืคืžืื™ืฅ ืืจื’ื•ื ื™

ืคืชืจื•ื ื•ืช

ื›ืœ ื”ืคืชืจื•ื ื•ืชืืคืœื™ืงืฆื™ื•ืช ื‘ืจื™ืื•ืช ื•ื›ื•ืฉืจืคืœื˜ืคื•ืจืžืช ื•ื™ื“ืื• AIืคื™ืชื•ื— ืกื•ื›ื ื™ AI

ืžืฉืื‘ื™ื

ืชื•ื‘ื ื•ืชืžื“ืจื™ื›ื™ ืชืขืฉื™ื™ื”ืชื•ื›ื ื™ื•ืช ืžืงืจื” ืฉื™ืžื•ืฉืชื‘ื ื™ื•ืช ืืจื›ื™ื˜ืงื˜ื•ืจื”ืžื—ืงืจื™ ืžืงืจื”

ื—ื‘ืจื”

ืื•ื“ื•ืชื™ื ื•ืฆื•ืจ ืงืฉืจื”ืขื‘ื•ื“ื” ืฉืœื ื•

ืฉื™ืจื•ืชื™ื

ื™ื™ืขื•ืฅ ื“ื™ื’ื™ื˜ืœื™ืชืฉืชื™ืช ืขื ืŸืคื™ืชื•ื— SaaSืคื™ืชื•ื— AIื˜ื›ื ื•ืœื•ื’ื™ื™ืช ื•ื™ื“ืื•
ืคื™ืชื•ื— ERPื”ืชืืžื” ืื™ืฉื™ืช ืฉืœ Zohoืคื™ืชื•ื— Odooืื™ื ื˜ื’ืจืฆื™ื” ืฉืœ Salesforceืคื™ืชื•ื— CRM ืžื•ืชืื ืื™ืฉื™ืช
ืื™ื ื˜ื’ืจืฆื™ื” ืฉืœ QuickBooksืคืชืจื•ื ื•ืช IoTืคื™ืชื•ื— ื‘ืœื•ืงืฆ'ื™ื™ืŸ
ื™ื™ืขื•ืฅ ืกื™ื™ื‘ืจืชืžื™ื›ื” ื˜ื›ื ื™ืช - L3

ยฉ 2026 MicrocosmWorks. ื›ืœ ื”ื–ื›ื•ื™ื•ืช ืฉืžื•ืจื•ืช.

ืžื“ื™ื ื™ื•ืช ืคืจื˜ื™ื•ืชืชื ืื™ ืฉื™ืจื•ืช
ื—ื–ืจื” ืœืชื‘ื ื™ื•ืช ืืจื›ื™ื˜ืงื˜ื•ืจื”
AI / DataAdvanced

ืืจื›ื™ื˜ืงื˜ื•ืจืช RAG Pipeline

ื”ืขื ืง ืœ-LLM ืฉืœืš ื’ื™ืฉื” ืœื ืชื•ื ื™ื ืฉืœืš ืœืœื ืฆื•ืจืš ื‘-fine-tuning. RAG ืžื’ืฉืจ ืขืœ ื”ืคืขืจ ื‘ื™ืŸ ืžื•ื“ืœื™ ืฉืคื” ื›ืœืœื™ื™ื ืœื™ื“ืข ืกืคืฆื™ืคื™ ืœืชื—ื•ื.

June 22, 2026
|
2 topics covered
ื“ื™ื•ืŸ ื‘ืืจื›ื™ื˜ืงื˜ื•ืจื” ื–ื•
rag-pipeline-architecture.webp
AI / Data
Category
Advanced
Complexity
Legal, Healthcare
Industries
2+
Technologies

ืžืชื™ ืชืฆื˜ืจืš ื–ืืช

ื‘ืจืฆื•ื ืš ืœื‘ื ื•ืช ืขื•ื–ืจ AI ืฉืขื•ื ื” ืขืœ ืฉืืœื•ืช ื”ื ื•ื’ืขื•ืช ืœืžืกืžื›ื™ ื”ืืจื’ื•ืŸ ืฉืœืš โ€” ื—ื•ื–ื™ื, ืžื“ื™ื ื™ื•ืช, ืžืื’ืจื™ ื™ื“ืข, ืชื™ืขื•ื“ ืžื•ืฆืจ, ืจืฉื•ืžื•ืช ืจืคื•ืื™ื•ืช. ื‘ื™ืฆื•ืข fine-tuning ืœ-LLM ืขืœ ื”ื ืชื•ื ื™ื ืฉืœืš ื™ืงืจ, ืื™ื˜ื™ ื•ื™ื•ืฆืจ ืžื•ื“ืœ ืงืคื•ื ื‘ื ืงื•ื“ืช ื”ืื™ืžื•ืŸ. ืืชื” ื–ืงื•ืง ืœืืจื›ื™ื˜ืงื˜ื•ืจื” ืฉื‘ื” ื”-LLM ื™ื›ื•ืœ ืœื’ืฉืช ืœืžื™ื“ืข ืขื“ื›ื ื™ ื•ืกืคืฆื™ืคื™ ืœืชื—ื•ื ื‘ื–ืžืŸ ืฉืื™ืœืชื”, ืœืฆื˜ื˜ ืืช ืžืงื•ืจื•ืชื™ื•, ื•ืœื”ื™ืžื ืข ืž'ื”ื–ื™ื•ืช' ืขื•ื‘ื“ื•ืช ืฉืื™ื ืŸ ื ืžืฆืื•ืช ื‘ืžืกืžื›ื™ื ืฉืœืš. RAG (Retrieval-Augmented Generation) ื”ื™ื ื”ื“ืจืš ืœื”ื’ื™ืข ืœืฉื.

Related Architecture Patterns

Explore more design patterns and system architectures

ai-ml-pipeline-architecture.webp
AI / Data

ืืจื›ื™ื˜ืงื˜ื•ืจืช Pipeline ืฉืœ AI/ML

ืžื•ื“ืœื™ื ืœื ืžืจื™ืฆื™ื ืืช ืขืฆืžื. ื”-Pipeline ืฉืžื›ืฉื™ืจ, ืžืืžืช, ืคื•ืจืก ื•ืžื ื˜ืจ ืืช ื”ืžื•ื“ืœื™ื ืฉืœืš ื”ื•ื ื”ืžื•ืฆืจ ื”ืืžื™ืชื™ โ€“ ื”ืžื•ื“ืœ ื”ื•ื ืจืง ืชื•ืฆืจ ืื—ื“.

EnterpriseView
scalable-vector-database-architecture.webp

ื”ืื ืืชื” ื–ืงื•ืง ืœืขื–ืจื” ื‘ื”ื˜ืžืขืช ืืจื›ื™ื˜ืงื˜ื•ืจื” ื–ื•?

ืื“ืจื™ื›ืœื™ื ืฉืœื ื• ื™ื›ื•ืœื™ื ืœืขื–ื•ืจ ืœืš ืœืขืฆื‘ ื•ืœื‘ื ื•ืช ืžืขืจื›ื•ืช ืชื•ืš ืฉื™ืžื•ืฉ ื‘ื“ืคื•ืก ื–ื” ืœื“ืจื™ืฉื•ืช ื”ืกืคืฆื™ืคื™ื•ืช ืฉืœืš.

ืฆืจื• ืงืฉืจ

ืกืงื™ืจืช ืชื‘ื ื™ืช

RAG ืžืฉืคืจ ืืช ื™ืฆื™ืจืช ื”-LLM ืขื ื”ืงืฉืจ ืžืื•ื—ื–ืจ ืžืžืื’ืจ ื™ื“ืข. ื‘ื–ืžืŸ ืฉืื™ืœืชื”, ื”ืžืขืจื›ืช ืžืžื™ืจื” ืืช ืฉืืœืช ื”ืžืฉืชืžืฉ ืœ-embedding, ืžื—ืคืฉืช ื‘-vector database ืื—ืจ document chunks ื“ื•ืžื™ื ืกืžื ื˜ื™ืช, ื•ื›ื•ืœืœืช ืืช ื”-chunks ื”ืจืœื•ื•ื ื˜ื™ื™ื ื‘ื™ื•ืชืจ ื›ื”ืงืฉืจ ื‘-LLM prompt. ื–ื” ืžื‘ืกืก ืืช ืชื’ื•ื‘ืช ื”ืžื•ื“ืœ ื‘ืžืกืžื›ื™ื ืืžื™ืชื™ื™ื, ืžืืคืฉืจ ืฆื™ื˜ื•ื˜ ืžืงื•ืจื•ืช, ื•ืฉื•ืžืจ ืขืœ ืžืื’ืจ ื”ื™ื“ืข ื ื™ืชืŸ ืœืขื“ื›ื•ืŸ ืœืœื ืื™ืžื•ืŸ ืžื—ื“ืฉ. RAG pipeline ื‘ืกื‘ื™ื‘ืช ื™ื™ืฆื•ืจ ืžื˜ืคืœ ื‘ื”ื›ื ืกื” (parsing, chunking, embedding), ืื—ื–ื•ืจ (vector search, reranking, hybrid search), ื•ื™ืฆื™ืจื” (prompt construction, streaming, guardrails).

ืืจื›ื™ื˜ืงื˜ื•ืจืช ื™ื™ื—ื•ืก

ืœืืจื›ื™ื˜ืงื˜ื•ืจื” ืฉื ื™ pipelines. ื”-ingestion pipeline ืžืขื‘ื“ ืžืกืžื›ื™ื ื‘ืืžืฆืขื•ืช parsing (ื—ื™ืœื•ืฅ PDF, DOCX, HTML), chunking (ืกืžื ื˜ื™ ืื• ื‘ื’ื•ื“ืœ ืงื‘ื•ืข ืขื ื—ืคื™ืคื”), embedding (ื‘ืืžืฆืขื•ืช embedding model), ื•ืื—ืกื•ืŸ (vector database + document store). ื”-query pipeline ืžืงื‘ืœ ืฉืืœืช ืžืฉืชืžืฉ, ืžื™ื™ืฆืจ query embedding, ืžืื—ื–ืจ candidate chunks ืž-vector database, ืžื“ืจื’ ืื•ืชื ืžื—ื“ืฉ ืœืคื™ ืจืœื•ื•ื ื˜ื™ื•ืช, ื‘ื•ื ื” prompt ืขื ื”-chunks ื”ืžื•ื‘ื™ืœื™ื ื›ื”ืงืฉืจ, ื•ืžื–ืจื™ื ืืช ืชื’ื•ื‘ืช ื”-LLM ืขื ืฆื™ื˜ื•ื˜ื™ ืžืงื•ืจื•ืช.

ืจื›ื™ื‘ื™ ืœื™ื‘ื”
  • Document Ingestion Pipeline: parser ืžืจื•ื‘ื” ืคื•ืจืžื˜ื™ื (Apache Tika, Unstructured, ืื• ืžื•ืชืื ืื™ืฉื™ืช) ืฉืžื—ืœืฅ ื˜ืงืกื˜ ืž-PDFs, DOCX, HTML, Markdown ื•ืชืžื•ื ื•ืช ืกืจื•ืงื•ืช (OCR). ืืกื˜ืจื˜ื’ื™ื™ืช Chunking ืžืคืฆืœืช ืžืกืžื›ื™ื ืœื™ื—ื™ื“ื•ืช ื ื™ืชื ื•ืช ืœืื—ื–ื•ืจ โ€” MW ืžืฉืชืžืฉ ื›ื‘ืจื™ืจืช ืžื—ื“ืœ ื‘-semantic chunking (ืคื™ืฆื•ืœ ื‘ื’ื‘ื•ืœื•ืช ืคืกืงืื•ืช/ืกืขื™ืคื™ื) ืขื ื’ื•ื“ืœ ื™ืขื“ ืฉืœ 512 tokens ื•ื—ืคื™ืคื” ืฉืœ 50 tokens
  • Embedding Service: ืžืžื™ืจ text chunks ืœ-vector embeddings. ืžืฉืชืžืฉ ื‘ืžื•ื“ืœื™ื ื›ืžื• OpenAI text-embedding-3-large, Cohere embed-v4, ืื• ื—ืœื•ืคื•ืช ืงื•ื“ ืคืชื•ื— (BGE, E5). ืขื™ื‘ื•ื“ ืืฆื•ื•ื” ืœื”ื›ื ืกื”, ืขื™ื‘ื•ื“ ืฉืื™ืœืชื” ื™ื—ื™ื“ื” ืœื—ื™ืคื•ืฉ
  • Vector Database: ืžืื—ืกืŸ embeddings ืขื metadata ืœื—ื™ืคื•ืฉ ืžืกื•ื ืŸ. ืชื•ืžืš ื‘ื—ื™ืคื•ืฉ approximate nearest neighbor (ANN) ื‘ืงื ื” ืžื™ื“ื” ืจื—ื‘. ืจืื” Scalable Vector Database Architecture ืœืฉื™ืงื•ืœื™ื ื‘ืงื ื” ืžื™ื“ื” ืฉืœ ื™ื™ืฆื•ืจ
  • ืื—ื–ื•ืจ ื•ื“ื™ืจื•ื’ ืžื—ื“ืฉ (Reranking): ืื—ื–ื•ืจ ื“ื•-ืฉืœื‘ื™ โ€” ื—ื™ืคื•ืฉ ANN ืžื”ื™ืจ ืžื—ื–ื™ืจ 50 ืžื•ืขืžื“ื™ื ืžื•ื‘ื™ืœื™ื, ื•ืœืื—ืจ ืžื›ืŸ reranker ืžืกื•ื’ cross-encoder (Cohere Rerank, BGE Reranker, ืื• ColBERT) ืžื“ืจื’ ื›ืœ ืžื•ืขืžื“ ืžื•ืœ ื”ืฉืื™ืœืชื” ืœื“ื™ืจื•ื’ ืจืœื•ื•ื ื˜ื™ื•ืช ืžื“ื•ื™ืง. 5 ื”-chunks ื”ืžื•ื‘ื™ืœื™ื ืขื•ื‘ืจื™ื ืœ-LLM
  • ื—ื™ืคื•ืฉ ื”ื™ื‘ืจื™ื“ื™ (Hybrid Search): ืžืฉืœื‘ ื—ื™ืคื•ืฉ ื•ืงื˜ื•ืจื™ (semantic) ืขื ื—ื™ืคื•ืฉ ืžื™ืœื•ืช ืžืคืชื— (BM25). ื–ื” ืชื•ืคืก ืžืงืจื™ื ืฉื‘ื”ื ื—ื™ืคื•ืฉ ื•ืงื˜ื•ืจื™ ืžืคืกืคืก ื˜ืจืžื™ื ื•ืœื•ื’ื™ื” ืžื“ื•ื™ืงืช (ืงื•ื“ื™ ืžื•ืฆืจ, ืกืขื™ืคื™ื ืžืฉืคื˜ื™ื™ื, ืžื•ื ื—ื™ื ืจืคื•ืื™ื™ื) ืฉื—ื™ืคื•ืฉ ืžื™ืœื•ืช ืžืคืชื— ืžื˜ืคืœ ื‘ื”ื ื”ื™ื˜ื‘. ืžื™ื–ื•ื’ ื“ื™ืจื•ื’ ื”ื“ื“ื™ (Reciprocal rank fusion) ืžืื—ื“ ืืช ืฉืชื™ ืงื‘ื•ืฆื•ืช ื”ืชื•ืฆืื•ืช

ื”ื—ืœื˜ื•ืช ืขื™ืฆื•ื‘ ื•ืคืฉืจื•ืช

ืืกื˜ืจื˜ื’ื™ื™ืช Chunking: ื’ื•ื“ืœ ืงื‘ื•ืข (Fixed-Size) ืžื•ืœ ืกืžื ื˜ื™ (Semantic) ืžื•ืœ ืžื‘ื ื” ืžืกืžืš (Document-Structure)
Chunking ื‘ื’ื•ื“ืœ ืงื‘ื•ืข (ืคื™ืฆื•ืœ ื›ืœ N tokens) ืคืฉื•ื˜ ืืš ืฉื•ื‘ืจ ื‘ืืžืฆืข ืžืฉืคื˜ ื•ืžืื‘ื“ ืืช ืžื‘ื ื” ื”ืžืกืžืš. Semantic chunking (ืคื™ืฆื•ืœ ื‘ื’ื‘ื•ืœื•ืช ื˜ื‘ืขื™ื™ื โ€” ืคืกืงืื•ืช, ืกืขื™ืคื™ื, ื›ื•ืชืจื•ืช) ืžืฉืžืจ ื”ืงืฉืจ ืืš ืžื™ื™ืฆืจ chunks ื‘ื’ื“ืœื™ื ืžืฉืชื ื™ื. Document-structure chunking (ื”ืชื—ืฉื‘ื•ืช ื‘ื”ื™ืจืจื›ื™ื™ืช ื”ืžืกืžืš โ€” ืคืจืงื™ื, ืกืขื™ืคื™ื, ืชืช-ืกืขื™ืคื™ื) ื”ื•ื ื”ื˜ื•ื‘ ื‘ื™ื•ืชืจ ืขื‘ื•ืจ ืžืกืžื›ื™ื ืžื•ื‘ื ื™ื ื›ืžื• ื—ื•ื–ื™ื ืžืฉืคื˜ื™ื™ื ืื• ืžื“ืจื™ื›ื™ื ื˜ื›ื ื™ื™ื. MW ืžืฉืชืžืฉ ื›ื‘ืจื™ืจืช ืžื—ื“ืœ ื‘-semantic chunking ื•ืขื•ื‘ืจ ืœ-document-structure ืขื‘ื•ืจ ืžืงื•ืจื•ืช ื‘ืขืœื™ ืคื•ืจืžื˜ ืžื•ืจื›ื‘.
ื—ื™ืคื•ืฉ ื•ืงื˜ื•ืจื™ (Vector Search) ืžื•ืœ ื—ื™ืคื•ืฉ ื”ื™ื‘ืจื™ื“ื™ (Hybrid Search)
ื—ื™ืคื•ืฉ ื•ืงื˜ื•ืจื™ ื˜ื”ื•ืจ ืขื•ื‘ื“ ื”ื™ื˜ื‘ ืขื‘ื•ืจ ืฉืื™ืœืชื•ืช ืฉื™ื—ืชื™ื•ืช ("ืื™ืš ืื ื™ ืžื˜ืคืœ ื‘ื”ื—ื–ืจื™ื ื›ืกืคื™ื™ื?") ืืš ื ื›ืฉืœ ื‘ืฉืื™ืœืชื•ืช ื”ืชืืžื” ืžื“ื•ื™ืงืช ("ืžื”ื• ืกืขื™ืฃ 7.3.2?"). ื—ื™ืคื•ืฉ ื”ื™ื‘ืจื™ื“ื™ (ื•ืงื˜ื•ืจื™ + ืžื™ืœืช ืžืคืชื— BM25) ืžื˜ืคืœ ื‘ืฉื ื™ื”ื. MW ืžืžืœื™ืฅ ืขืœ ื—ื™ืคื•ืฉ ื”ื™ื‘ืจื™ื“ื™ ืขื‘ื•ืจ ื›ืœ ืชื—ื•ื ืขื ื˜ืจืžื™ื ื•ืœื•ื’ื™ื”, ืงื•ื“ื™ื ืื• ืžื–ื”ื™ื ืกืคืฆื™ืคื™ื™ื โ€” ืฉื–ื” ืจื•ื‘ ื”ืชื—ื•ืžื™ื ื”ืืจื’ื•ื ื™ื™ื. ื”ืžื•ืจื›ื‘ื•ืช ื”ื ื•ืกืคืช ืฉืœ 10-15% ืฉื•ื•ื” ืืช ืฉื™ืคื•ืจ ื”ืจืœื•ื•ื ื˜ื™ื•ืช ื”ืžืฉืžืขื•ืชื™.
Reranking: Cross-Encoder ืžื•ืœ ืœืœื (None)
Reranking ืžืกื•ื’ cross-encoder ืžื•ืกื™ืฃ ื”ืฉื”ื™ื” ืฉืœ 100-300ms ืืš ืžืฉืคืจ ื‘ืื•ืคืŸ ื“ืจืžื˜ื™ ืืช ื“ื™ื•ืง ื”ืื—ื–ื•ืจ โ€” ืžื“ื“ื ื• ืฉื™ืคื•ืจ ืฉืœ 15-25% ื‘ืจืœื•ื•ื ื˜ื™ื•ืช ืฉืœ 5 ื”ืžื•ื‘ื™ืœื™ื ื‘ืชื—ื•ืžื™ ื”ืžืฉืคื˜ ื•ื”ื‘ืจื™ืื•ืช. MW ื›ื•ืœืœ reranking ื›ื‘ืจื™ืจืช ืžื—ื“ืœ ืขื‘ื•ืจ ื›ืœ ืžืขืจื›ืช RAG ืฉื‘ื” ืื™ื›ื•ืช ื”ืชืฉื•ื‘ื” ื—ืฉื•ื‘ื” ื™ื•ืชืจ ืžื”ืฉื”ื™ื” ืฉืœ ืคื—ื•ืช ืžืฉื ื™ื™ื”. ืขื‘ื•ืจ ืฆ'ืื˜ื‘ื•ื˜ื™ื ืฉื‘ื”ื ื”ืžื”ื™ืจื•ืช ืงืจื™ื˜ื™ืช, ืื ื• ืžื“ืœื’ื™ื ืขืœ reranking ื•ืžืคืฆื™ื ืขื chunking ื•-prompt engineering ื˜ื•ื‘ื™ื ื™ื•ืชืจ.
ื•ืงื˜ื•ืจ ื™ื—ื™ื“ (Single-Vector) ืžื•ืœ ืจื‘-ื•ืงื˜ื•ืจื™ (Multi-Vector) (ื‘ืกื’ื ื•ืŸ ColBERT)
Embeddings ืžืกื•ื’ ื•ืงื˜ื•ืจ ื™ื—ื™ื“ ืคืฉื•ื˜ื™ื ื•ื–ื•ืœื™ื ื™ื•ืชืจ ืœืื—ืกื•ืŸ/ื—ื™ืคื•ืฉ. ื™ื™ืฆื•ื’ื™ื ืจื‘-ื•ืงื˜ื•ืจื™ื™ื (ื•ืงื˜ื•ืจ ืื—ื“ ืœื›ืœ token, ื ื™ืงื•ื“ ืื™ื ื˜ืจืืงืฆื™ื” ืžืื•ื—ืจืช) ืœื•ื›ื“ื™ื ื™ื•ืชืจ ื ื™ื•ืื ืกื™ื ืืš ื“ื•ืจืฉื™ื ืชืฉืชื™ืช ืžื™ื•ื—ื“ืช. MW ืžืฉืชืžืฉ ื‘ื•ืงื˜ื•ืจ ื™ื—ื™ื“ ืขื‘ื•ืจ ืจื•ื‘ ื”ืคืจื™ืกื•ืช ื•ืฉื•ืžืจ ื™ื™ืฆื•ื’ื™ื ืจื‘-ื•ืงื˜ื•ืจื™ื™ื ืœืชื—ื•ืžื™ื ืฉื‘ื”ื ืื™ื›ื•ืช ื”ืื—ื–ื•ืจ ื”ื™ื ืฆื•ื•ืืจ ื”ื‘ืงื‘ื•ืง ื•ืงื•ืจืคื•ืก ื”ืžืกืžื›ื™ื ืขื•ืœื” ืขืœ 100K chunks.

ื‘ื—ื™ืจื•ืช ื˜ื›ื ื•ืœื•ื’ื™ื•ืช

ืฉื›ื‘ื”ื˜ื›ื ื•ืœื•ื’ื™ื•ืช
ื ื™ืชื•ื— ืžืกืžื›ื™ื (Document Parsing)Unstructured, Apache Tika, LlamaParse, Docling, custom OCR (Tesseract, AWS Textract)
EmbeddingOpenAI text-embedding-3-large, Cohere embed-v4, BGE-M3, E5-large-v2
Vector DatabaseMilvus, Pinecone, Qdrant, Weaviate, pgvector (for small-scale)
ื—ื™ืคื•ืฉ ืžื™ืœื•ืช ืžืคืชื— (Keyword Search)Elasticsearch, OpenSearch, PostgreSQL full-text search
RerankingCohere Rerank, BGE Reranker, ColBERT v2, FlashRank
LLMClaude (via AI Gateway), GPT-4, Gemini โ€” ืื’ื ื•ืกื˜ื™ ืœืกืคืง ื‘ืืžืฆืขื•ืช AI SDK
ืื•ืจืงืกื˜ืจืฆื™ื” (Orchestration)LangChain, LlamaIndex, ืื• custom pipeline (ื”ืขื“ืคื” ืฉืœ MW ืœืกื‘ื™ื‘ืช ื™ื™ืฆื•ืจ)

ืžืชื™ ืœื”ืฉืชืžืฉ / ืžืชื™ ืœื”ื™ืžื ืข

ื”ืฉืชืžืฉ ื›ืืฉืจื”ื™ืžื ืข ื›ืืฉืจ
ืžืฉืชืžืฉื™ื ื–ืงื•ืงื™ื ืœืชืฉื•ื‘ื•ืช ื”ืžื‘ื•ืกืกื•ืช ืขืœ ืžืกืžื›ื™ื ืกืคืฆื™ืคื™ื™ื ืฉืœ ื”ืืจื’ื•ืŸ ืฉืœืšืžืื’ืจ ื”ื™ื“ืข ื”ื•ื ืคื—ื•ืช ืž-50 ืขืžื•ื“ื™ื โ€” ืคืฉื•ื˜ ื”ื›ื ืก ืื•ืชื• ืœ-system prompt
ืžืกืžื›ื™ื ืžืชืขื“ื›ื ื™ื ืœืขื™ืชื™ื ืงืจื•ื‘ื•ืช ื•ื”-AI ื–ืงื•ืง ืœืžื™ื“ืข ืขื“ื›ื ื™ืืชื” ืฆืจื™ืš ืฉื”ืžื•ื“ืœ ื™ืœืžื“ ืžื™ื•ืžื ื•ืช/ื”ืชื ื”ื’ื•ืช ื—ื“ืฉื”, ืœื ื™ื’ืฉ ืœืขื•ื‘ื“ื•ืช ื—ื“ืฉื•ืช (ื‘ืฆืข fine-tune ื‘ืžืงื•ื)
ืฆื™ื˜ื•ื˜ ืžืงื•ืจื•ืช ื•ื™ื›ื•ืœืช ื‘ื™ืงื•ืจืช ื”ื ื“ืจื™ืฉื•ืช (ืžืฉืคื˜ื™, ืฆื™ื•ืช, ื‘ืจื™ืื•ืช)ื”ืฉืืœื•ืช ื”ืŸ ืฉื™ื—ืชื™ื•ืช ื‘ืœื‘ื“ ื•ืื™ื ืŸ ื“ื•ืจืฉื•ืช ื‘ื™ืกื•ืก ืขื•ื‘ื“ืชื™
ืงื‘ื•ืฆื•ืช ืžืฉืชืžืฉื™ื ืžืจื•ื‘ื•ืช ื–ืงื•ืงื•ืช ืœื’ื™ืฉื” ืœืชืช-ืงื‘ื•ืฆื•ืช ืžืกืžื›ื™ื ืฉื•ื ื•ืช (RAG ืžืกื•ื ืŸ ื”ืจืฉืื•ืช)ืืชื” ื‘ื•ื ื” ื›ืœื™ ื›ืชื™ื‘ื” ื™ืฆื™ืจืชื™ืช ืฉื‘ื• ื“ื™ื•ืง ืขื•ื‘ื“ืชื™ ืื™ื ื• ื”ืžื˜ืจื”

ื”ื’ื™ืฉื” ืฉืœื ื•

MW ื‘ื•ื ื” RAG pipelines ืžืื™ื›ื•ืช ื”ืื—ื–ื•ืจ ื›ืœืคื™ ื—ื•ืฅ โ€” ืื ื• ืžื•ื“ื“ื™ื ืืช ื“ื™ื•ืง ื”ืื—ื–ื•ืจ ืœืคื ื™ ื ื’ื™ืขื” ื‘-LLM prompt. ืžืขืจื›ืช RAG ืขื ืื—ื–ื•ืจ ื‘ื™ื ื•ื ื™ ื•-LLM ืžืขื•ืœื” ืžื™ื™ืฆืจืช ืชืฉื•ื‘ื•ืช ืฉื’ื•ื™ื•ืช ืฉื ืฉืžืขื•ืช ื‘ื˜ื•ื—ื•ืช. ื”-pipeline ื”ืกื˜ื ื“ืจื˜ื™ ืฉืœื ื• ื›ื•ืœืœ ื›ืœื™ ื”ืขืจื›ืช ืื—ื–ื•ืจ: ืงื‘ื•ืฆื” ืฉืœ ืฉืื™ืœืชื•ืช ื‘ื“ื™ืงื” ืขื ืžืกืžื›ื™ื ืจืœื•ื•ื ื˜ื™ื™ื ื™ื“ื•ืขื™ื, ื”ื ืžื“ื“ื™ื ืœืคื™ MRR@5 ื•-NDCG@10. ืื ื• ืžื‘ืฆืขื™ื ืื™ื˜ืจืฆื™ื•ืช ืขืœ chunking, embedding model ื•-reranking ืขื“ ืฉืžื“ื“ื™ ื”ืื—ื–ื•ืจ ืžื’ื™ืขื™ื ืœืกืคื™ ื”ื™ืขื“ ืœืคื ื™ ืื•ืคื˜ื™ืžื™ื–ืฆื™ื” ืฉืœ ื™ืฆื™ืจื”. ื‘ื ื™ื ื• ืžืขืจื›ื•ืช RAG ืœืกืงื™ืจืช ืžืกืžื›ื™ื ืžืฉืคื˜ื™ื™ื, ืžืื’ืจื™ ื™ื“ืข ื‘ืชื—ื•ื ื”ื‘ืจื™ืื•ืช ื•ืชืžื™ื›ืช ืœืงื•ื—ื•ืช ืจื‘-ืœืฉื•ื ื™ืช โ€” ื•ื”ืœืงื— ื”ืžืฉื•ืชืฃ ื”ื•ื ืฉืื™ื›ื•ืช ื”ืื—ื–ื•ืจ ืžื”ื•ื•ื” 80% ืžืื™ื›ื•ืช ื”ืชืฉื•ื‘ื”.

ืชื•ื›ื ื™ื•ืช ืงืฉื•ืจื•ืช

  • AI Customer Support Agent โ€” ืกื•ื›ืŸ ืชืžื™ื›ื” ืžื‘ื•ืกืก RAG ืขื ืื—ื–ื•ืจ ืžืžืื’ืจ ื™ื“ืข
  • AI Document Processing Pipeline โ€” ื”ื›ื ืกืช ืžืกืžื›ื™ื, ื ื™ืชื•ื— ื•ื—ื™ืœื•ืฅ ืžื‘ื•ืกืก AI

ืžื“ืจื™ื›ื™ื ืชืขืฉื™ื™ืชื™ื™ื ืงืฉื•ืจื™ื

  • AI for Legal โ€” ื™ื™ืฉื•ืžื™ RAG ื‘ืกืงื™ืจืช ื—ื•ื–ื™ื ื•ืžื—ืงืจ ืžืฉืคื˜ื™

ืžืงืจื™ ื‘ื•ื—ืŸ ืงืฉื•ืจื™ื

  • Document Intelligence โ€” RAG pipeline ืžืงื•ืžื™ ืœื ื™ืชื•ื— ื’ื™ืœื™ื•ื ื•ืช ืืœืงื˜ืจื•ื ื™ื™ื ื•ืžืกืžื›ื™ื
  • AI Chat Platform โ€” ืฆ'ืื˜ ืžืจื•ื‘ื” ืžื•ื“ืœื™ื ืขื ืื—ื–ื•ืจ ืžืกืžื›ื™ื ื•ื˜ื™ืคื•ืœ ื‘ื ืชื•ื ื™ื ืชื•ืื GDPR
Related Technologies
AI DevelopmentSaaS Development
AI / Data

ืืจื›ื™ื˜ืงื˜ื•ืจื” ืฉืœ ื‘ืกื™ืก ื ืชื•ื ื™ื ื•ืงื˜ื•ืจื™ ืžื“ืจื’ื™

ื—ื™ืคื•ืฉ ื”ื˜ืžืขื•ืช ืงืœ ืขื‘ื•ืจ 10K ื•ืงื˜ื•ืจื™ื. ืขื‘ื•ืจ 100M ื•ืงื˜ื•ืจื™ื ืขื P99 ื”ื ืžื•ืš ืž-100ms, ื–ื• ื‘ืขื™ื™ืช ืชืฉืชื™ืช โ€” ื•ื–ื• ื”ื‘ืขื™ื” ืฉื”ืชื‘ื ื™ืช ื”ื–ื• ืคื•ืชืจืช.

EnterpriseView
cloud-native-infrastructure.webp
Infrastructure

ืชืฉืชื™ืช Cloud-Native

ืชืฉืชื™ืช ืฉืžื ื•ื”ืœืช ื‘ื’ืจืกืื•ืช, ื ื‘ื“ืงืช ื•ื ืคืจืกืช ื›ืžื• ืงื•ื“ ื™ื™ืฉื•ื โ€” ื›ื™ ื”ืคืœื˜ืคื•ืจืžื” ืฉืœืš ืืžื™ื ื” ืจืง ื›ืžื• ืžื” ืฉื ืžืฆื ืžืชื—ืชื™ื”.

EnterpriseView

ืฉืืœื•ืช ื ืคื•ืฆื•ืช

MicrocosmWorks ืžื™ื™ืฉืžืช ื™ื™ืฉื•ื‘ ืงื•ื ืคืœื™ืงื˜ื™ื ื‘ืฆื™ื ื•ืจื•ืช RAG ื‘ืืžืฆืขื•ืช ื“ื™ืจื•ื’ ืกืžื›ื•ืช ืžืงื•ืจ, ืฉืงื™ืœืช ืขื“ื›ื ื™ื•ืช ืžื‘ื•ืกืกืช ื—ื•ืชืžืช ื–ืžืŸ, ื•ื ื™ืงื•ื“ ื‘ื™ื˜ื—ื•ืŸ ื”ืžืขืจื™ืš ืขื“ ื›ืžื” ื›ืœ ืงื˜ืข ืžืื•ื—ื–ืจ ืชื•ืžืš ื‘ื˜ืขื ืชื•. ื›ืืฉืจ ืžืื•ื—ื–ืจื™ื ืงื˜ืขื™ื ืกื•ืชืจื™ื, ื”ืฆื™ื ื•ืจ ืฉืœื ื• ืžืฆื™ื’ ืืช ื”ืชืฉื•ื‘ื” ื‘ืขืœืช ื”ืกืžื›ื•ืช ื”ื’ื‘ื•ื”ื” ื‘ื™ื•ืชืจ ืชื•ืš ื”ืฆื’ื” ืฉืงื•ืคื” ืฉืœ ืื™ ื”ื”ืกื›ืžื” ื•ืฆื™ื˜ื•ื˜ื™ ื”ืžืงื•ืจื•ืช, ื›ืš ืฉืžืฉืชืžืฉื™ื ื™ื•ื›ืœื• ืœืงื‘ืœ ื”ื—ืœื˜ื•ืช ืžื•ืฉื›ืœื•ืช. ืื ื• ื‘ื•ื ื™ื ื’ื ืœื•ืœืื•ืช ืžืฉื•ื‘ ืฉื‘ื”ืŸ ืžื•ืžื—ื™ ืชื—ื•ื ื™ื›ื•ืœื™ื ืœืกืžืŸ ื™ื™ืฉื•ื‘ื™ื ืฉื’ื•ื™ื™ื, ืžื” ืฉืžืฉืคืจ ืืช ื“ื™ืจื•ื’ ื”ืื—ื–ื•ืจ ืœืื•ืจืš ื–ืžืŸ.

MicrocosmWorks ืžืฉืชืžืฉืช ื‘-chunking ืžื•ื“ืข-ืชื•ื›ืŸ ืฉืžื™ื™ืฉื ืืกื˜ืจื˜ื’ื™ื•ืช ืฉื•ื ื•ืช ื‘ื”ืชื‘ืกืก ืขืœ ืžื‘ื ื” ื”ืžืกืžืš โ€“ ืคื™ืฆื•ืœ ืคืกืงืื•ืช ืกืžื ื˜ื™ ืœืคืจื•ื–ื”, chunking ื‘ืจืžืช ืฉื•ืจื” ืื• ื‘ืจืžืช ืžืงื˜ืข ืขื‘ื•ืจ ื˜ื‘ืœืื•ืช ืขื ืฉื™ืžื•ืจ ื”ืงืฉืจ ืฉืœ ื”ื›ื•ืชืจื•ืช, ื•-chunking ื‘ืจืžืช ืคื•ื ืงืฆื™ื” ืขื‘ื•ืจ ืงื•ื“ ืขื ื”ืฆื”ืจื•ืช import ืžืฆื•ืจืคื•ืช. ืื ื• ืžืขืฉื™ืจื™ื ื›ืœ chunk ื‘-metadata ื”ื›ื•ืœืœื™ื ื›ื•ืชืจืช ืžืกืžืš, ื”ื™ืจืจื›ื™ื™ืช ืžืงื˜ืขื™ื ื•ืกื•ื’ ืชื•ื›ืŸ, ื›ืš ืฉืฉืœื‘ ื”ืฉืœื™ืคื” ื™ื•ื›ืœ ืœื™ื™ืฉื ื ื™ืงื•ื“ ืกืคืฆื™ืคื™ ืœืกื•ื’. ื’ื™ืฉื” ื–ื• ืขื•ืœื” ื‘ืื•ืคืŸ ืขืงื‘ื™ ื‘ื‘ื™ืฆื•ืขื™ื” ืขืœ chunking ื ืื™ื‘ื™ ื‘ื’ื•ื“ืœ ืงื‘ื•ืข ื‘-25-40% ื‘ืžื“ื“ื™ ืจืœื•ื•ื ื˜ื™ื•ืช ืฉืœื™ืคื” ื‘ืคืจื•ื™ืงื˜ื™ ื”ืœืงื•ื—ื•ืช ืฉืœื ื•.

MicrocosmWorks ื‘ื•ื ื” ืžืขืจื›ื•ืช ื”ืขืจื›ื” ื”ื‘ื•ื“ืงื•ืช ืคื™ื™ืคืœื™ื™ื ื™ื ืฉืœ RAG ืขืœ ืคื ื™ ืฉืœื•ืฉื” ืžืžื“ื™ื: ืจืœื•ื•ื ื˜ื™ื•ืช ืื—ื–ื•ืจ (ื”ืื ื”-'ืฆ'ืื ืงื™ื' ื”ื ื›ื•ื ื™ื ื ืžืฆืื™ื), ื ืืžื ื•ืช ื”ืชืฉื•ื‘ื” (ื”ืื ื”ืชืฉื•ื‘ื” ืฉื ื•ืฆืจื” ืื›ืŸ ืžืฉืงืคืช ืืช ื”ืชื•ื›ืŸ ืฉืื•ื—ื–ืจ), ื•ืฉืœืžื•ืช ื”ืชืฉื•ื‘ื” (ื”ืื ื”ื™ื ืžืชื™ื™ื—ืกืช ืœืฉืืœื” ื”ืžืœืื”). ืื ื• ื™ื•ืฆืจื™ื ืžืขืจื›ื™ ื‘ื“ื™ืงื” 'ื’ื•ืœื“ืŸ' ืขื ืžื•ืžื—ื™ ืชื—ื•ื ื”ื›ื•ืœืœื™ื ืฉืื™ืœืชื•ืช ืขื ืชืฉื•ื‘ื•ืช ื™ื“ื•ืขื•ืช, ืžืงืจื™ ืงืฆื” ืขื•ื™ื ื™ื, ื•ืฉืืœื•ืช ื”ื“ื•ืจืฉื•ืช ืฉื™ืœื•ื‘ ืžื™ื“ืข ืžืžืกืคืจ ืžืกืžื›ื™ื. ื”ืขืจื›ื” ื–ื• ืจืฆื” ืื•ื˜ื•ืžื˜ื™ืช ื‘-CI/CD ื›ืš ืฉื›ืœ ืฉื™ื ื•ื™ ื‘ืคื™ื™ืคืœื™ื™ืŸ ื ืžื“ื“ ืืœ ืžื•ืœ ืžื“ื“ื™ ืื™ื›ื•ืช ื‘ืกื™ืกื™ื™ื ืœืคื ื™ ื”ืคืจื™ืกื”.

MicrocosmWorks ื‘ื•ื—ืจืช ืžืื’ืจื™ ื•ืงื˜ื•ืจื™ื ื‘ื”ืชื‘ืกืก ืขืœ ืงื ื” ื”ืžื™ื“ื” ืฉืœื›ื, ืชื‘ื ื™ืช ื”ืฉืื™ืœืชื•ืช ื•ื“ืจื™ืฉื•ืช ื”ืชืคืขื•ืœโ€”Pinecone ืœืคืฉื˜ื•ืช ืžื ื•ื”ืœืช, Weaviate ืœื—ื™ืคื•ืฉ ื”ื™ื‘ืจื™ื“ื™ ืฉืœ ืžื™ืœื•ืช ืžืคืชื—-ื•ืงื˜ื•ืจื™ื, pgvector ืœืฆื•ื•ืชื™ื ืฉื›ื‘ืจ ื”ืฉืงื™ืขื• ื‘-PostgreSQL, ื•-Qdrant ืœืคืจื™ืกื•ืช ืขืฆืžืื™ื•ืช ืขื ืชืคื•ืงื” ื’ื‘ื•ื”ื”. ื‘ืกืงืœื•ืช ืžืชื—ืช ืœ-10 ืžื™ืœื™ื•ืŸ ื•ืงื˜ื•ืจื™ื, ืจื•ื‘ ื”ืืคืฉืจื•ื™ื•ืช ืžืกืคืงื•ืช ื–ืžืŸ ืื—ื–ื•ืจ ืฉืœ ืคื—ื•ืช ืž-100ms, ืืš ื”ื”ื‘ื“ืœื™ื ื”ื•ืคื›ื™ื ืœืžืฉืžืขื•ืชื™ื™ื ื‘ืžืื•ืช ืžื™ืœื™ื•ื ื™ ื•ืงื˜ื•ืจื™ื ืฉื‘ื”ื ืกื•ื’ ื”ืื™ื ื“ืงืก, ืงื•ื•ื ื˜ื™ื–ืฆื™ื” ื•ืืกื˜ืจื˜ื’ื™ื™ืช ื—ืœื•ืงื” ื—ืฉื•ื‘ื™ื ื‘ืžื™ื“ื” ืขืฆื•ืžื”. ืื ื• ืžื‘ืฆืขื™ื ื‘ื ืฆ'ืžืจืง ืœืžืžื“ื™ ื”ื”ื˜ื‘ืขื” ื”ืืžื™ืชื™ื™ื ืฉืœื›ื ื•ืœืชื‘ื ื™ื•ืช ื”ืฉืื™ืœืชื•ืช ืžื•ืœ ื”ืืคืฉืจื•ื™ื•ืช ืฉื ื‘ื—ืจื• ื‘ืงืฆืจื” ื‘ืžื”ืœืš ืฉืœื‘ ืชื›ื ื•ืŸ ื”ืืจื›ื™ื˜ืงื˜ื•ืจื” ืฉืœื ื•.

MicrocosmWorks ื‘ื•ื ื” ืฆื™ื ื•ืจื•ืช ื”ื–ื ื” ืžืฆื˜ื‘ืจื™ื (incremental ingestion pipelines) ืฉืžื ื˜ืจื™ื ืžืื’ืจื™ ืžืกืžื›ื™ ืžืงื•ืจ (source document repositories) ืœืฉื™ื ื•ื™ื™ื, ืžื‘ืฆืขื™ื re-chunk ื•-re-embed ืจืง ืฉืœ ื”ืกืขื™ืคื™ื ืฉื”ืฉืชื ื•, ื•ืžืขื“ื›ื ื™ื ืืช ื”-vector store ืœืœื ืฆื•ืจืš ื‘-reindex ืžืœื. ืื ื• ืžื™ื™ืฉืžื™ื document fingerprinting ืฉืžื–ื”ื” ืฉื™ื ื•ื™ื™ื ื‘ืชื•ื›ืŸ ื‘ืจืžืช ื”ืกืขื™ืฃ (section level), ื›ืš ืฉืขืจื™ื›ื” ืฉืœ ืคืกืงื” ืื—ืช ืื™ื ื” ืžืคืขื™ืœื” ืขื™ื‘ื•ื“ ืžื—ื“ืฉ (reprocessing) ืฉืœ ืžืกืžืš ืฉืœื ื‘ืŸ 200 ืขืžื•ื“ื™ื. ืขื‘ื•ืจ ืœืงื•ื—ื•ืช ืขื ื“ืจื™ืฉื•ืช real-time freshness, ืื ื• ืžื•ืกื™ืคื™ื ืฉื›ื‘ืช live retrieval ืฉืžื‘ืฆืขืช ืฉืื™ืœืชื•ืช ื™ืฉื™ืจื•ืช ืœืžืขืจื›ืช ื”ืžืงื•ืจ ืขื‘ื•ืจ ืžืกืžื›ื™ื ืฉืฉื•ื ื• ืœืื—ืจื•ื ื” ื•ืžืžื–ื’ืช ืืช ื”ืชื•ืฆืื•ืช ื”ืœืœื• ืขื vector search hits.