cheRAGh (چراغ) is a unified benchmarking suite for Persian Retrieval-Augmented Generation (RAG) systems, covering embedding models, rerankers, retrieval quality, tool calling, and large language model performance across diverse Persian-language datasets from General, Scientific, Education, Legal, and Religious domains.
This report presents the evaluation results of embedding models in the cheRAGh benchmark. Retrieval performance is measured using Recall@5 and MRR.
| # ↕ | Model ↕ | Ctx Len ↕ | Emb Dim ↕ | Params ↕ | Size | Architecture | Education ↕ | General ↕ | Legal ↕ | Religious ↕ | Scientific ↕ | Average ↕ | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MRR ↕ | Recall@5 ↕ | MRR ↕ | Recall@5 ↕ | MRR ↕ | Recall@5 ↕ | MRR ↕ | Recall@5 ↕ | MRR ↕ | Recall@5 ↕ | MRR ↕ | Recall@5 ↕ | |||||||