Retrieval Augmented Generation(RAGs)解释[译]

博主： AIHGF
发布时间：2024 年 01 月 21 日
6834 次浏览
暂无评论
1223字数
分类：生成式AI

https://twitter.com/akshay_pachaar/status/1748322389321265261

Retrieval Augmented Generation(RAGs) 是增强 LLMs(Large Language Models) 性能的强大工具，通过在生成过程中整合额外的知识.

如图：

1. Custom Knowledge base

定制知识库：相关最新信息的集合，作为 RAG 的支撑. 其可以是 database、documents sets 等等.

2. Chunking 分块

Chunking 是将大规模输入文本分解为小片的过程，以确保文本能够适应 embedding model 的输入尺寸，提升检索效率.

合适的分块策略能够大大增强 RAG 系统.

3. Embeddings & Embedding Model

Embedding 是一种用于将文本数据表示数值向量的技术，以作为机器学习的输入.

4. Vector Databases

向量数据库，用于将预先计算的文本向量表示集合进行快速检索、相似度计算，其具有如 CRUD 操作、元数据过滤(metadata filtering)、横向扩展(horizontal scaling) 等功能.

5. User Chat Interface

用户聊天界面，用户友好的界面能够更好的实现与RAG系统的交互，提供输入查询、接受输出结果.

输入查询被转换为 embedding，以从向量数据库检索相关的内容.

6. Prompt Template

提示词模板是用于生成适合 RAG 系统 prompt 的过程，其可以是用户查询和定制数据库的结果. 其共同作为 LLM 的输入，以输出最终的响应结果.

最后修改：2024 年 01 月 21 日

© 允许规范转载

如果觉得我的文章对你有用，请随意赞赏

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

评论 *

私密评论

名称 *

🎲

邮箱 *

地址

Retrieval Augmented Generation(RAGs)解释[译]

AIHGF • 2024 年 01 月 21 日

<blockquote><span class="external-link"><a class="no-external-link" href="https://twitter.com/akshay_pachaar/status/1748322389321265261" target="_blank"><i data-feather="external-link"></i>https://twitter.com/akshay_pachaar/status/1748322389321265261</a></span></blockquote><p>Retrieval Augmented Generation(RAGs) 是增强 LLMs(Large Language Models) 性能的强大工具，通过在生成过程中整合额外的知识.</p><p>如图：</p><p><img src="https://aiuai.cn/uploads/2401/RAGs.gif" alt="" title="" style=""></p><h2>1. Custom Knowledge base</h2><p>定制知识库：相关最新信息的集合，作为 RAG 的支撑. 其可以是 database、documents sets 等等.</p><p><img src="https://aiuai.cn/uploads/2401/custom_knowledge_base.jpg" alt="" title="" style=""></p><h2>2. Chunking 分块</h2><p>Chunking 是将大规模输入文本分解为小片的过程，以确保文本能够适应 embedding model 的输入尺寸，提升检索效率.</p><p>合适的分块策略能够大大增强 RAG 系统.</p><p><img src="https://aiuai.cn/uploads/2401/text_chunks.jpg" alt="" title="" style=""></p><h2>3. Embeddings & Embedding Model</h2><p>Embedding 是一种用于将文本数据表示数值向量的技术，以作为机器学习的输入.</p><p><img src="https://aiuai.cn/uploads/2401/embedding_model.jpg" alt="" title="" style=""></p><h2>4. Vector Databases</h2><p>向量数据库，用于将预先计算的文本向量表示集合进行快速检索、相似度计算，其具有如 CRUD 操作、元数据过滤(metadata filtering)、横向扩展(horizontal scaling) 等功能.</p><p><img src="https://aiuai.cn/uploads/2401/vector_database.jpg" alt="" title="" style=""></p><h2>5. User Chat Interface</h2><p>用户聊天界面，用户友好的界面能够更好的实现与RAG系统的交互，提供输入查询、接受输出结果.</p><p>输入查询被转换为 embedding，以从向量数据库检索相关的内容.</p><p><img src="https://aiuai.cn/uploads/2401/rag.jpg" alt="" title="" style=""></p><h2>6. Prompt Template</h2><p>提示词模板是用于生成适合 RAG 系统 prompt 的过程，其可以是用户查询和定制数据库的结果. 其共同作为 LLM 的输入，以输出最终的响应结果.</p><p><img src="https://aiuai.cn/uploads/2401/prompt_template.jpg" alt="" title="" style=""></p>