The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
-
Updated
Jul 25, 2024 - Python
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Codes for NAACL 2021 Paper "Unsupervised Multi-hop Question Answering by Question Generation"
Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809
Pytorch implementation of "A Simple Yet Strong Pipeline for HotpotQA" (Groeneveld, D., Khot, T., & Sabharwal, A.). Now developing!
Enhancing Retrieval-Augmented Generation with Document Link Structure for Multi-hop Web Question Answering
This codebase implements a Retrieval-Augmented Generation (RAG) chatbot using the Gemini API and DSPy framework, designed to answer questions based on the HotPotQA dataset. It includes components for loading data, generating responses, and evaluating model performance through various QA strategies, including basic QA and multi-hop retrieval.
Episodic Distillation for Verified RAG using Google Cloud + MongoDB Atlas
Hallucination-resistant multi-hop QA using hybrid BM25+FAISS retrieval, cross-encoder reranking, citation selection, and NLI-based verification. Evaluated on HotpotQA (7,405 examples), zero-shot.
Implementation of concepts of NLP
LangGraph supervisor with parallel specialist agents. 1.99x speedup vs sequential multi-agent, reproducible in 6s with no API key. Planner, retrievers, analyzers, verifiers, synthesizer.
Agents Don't Always Do What They Think: Measuring Faithfulness in Multi-Step ReAct Agents
Exploration of retrieval methods on the HotpotQA corpus, combining dense retrieval and feature-based reranking. Achieved a mean nDCG@10 of 0.9416 using LambdaRank with features such as cross-encoder score, LLM score, BM25 score, and token-based statistics—surpassing dense retriever + cross-encoder baselines.
RAG system with adaptive retrieval (Qdrant dense + sparse + RRF), cross-encoder re-ranking, and optional long-term memory (Mem0) — evaluated using HotpotQA EM/F1 on 5K questions. Dual-backend: runs locally via Ollama or via API. Dockerized.
Local-only, graph-augmented, iteratively-reasoning, self-corrective RAG showcase on Apple Silicon — pushing toward published SOTA on multi-hop QA (MuSiQue, 2WikiMultiHopQA, HotpotQA) with HippoRAG 2 + Adaptive routing + Search-o1 iterative reasoning + CRAG-style gating + NLI faithfulness verification.
Add a description, image, and links to the hotpotqa topic page so that developers can more easily learn about it.
To associate your repository with the hotpotqa topic, visit your repo's landing page and select "manage topics."