24 datasets found

h
rag-mini-wikipedia
huggingface.co
Updated May 5, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RAG Datasets (2025). rag-mini-wikipedia [Dataset]. https://huggingface.co/datasets/rag-datasets/rag-mini-wikipedia
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 5, 2025
Dataset authored and provided by
RAG Datasets
License
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Description
In this huggingface discussion you can share what you used the dataset for. Derives from https://www.kaggle.com/datasets/rtatman/questionanswer-dataset?resource=download we generated our own subset using generate.py.
h
finanical-rag-embedding-dataset
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Philipp Schmid, finanical-rag-embedding-dataset [Dataset]. https://huggingface.co/datasets/philschmid/finanical-rag-embedding-dataset
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
Philipp Schmid
Description
philschmid/finanical-rag-embedding-dataset

philschmid/finanical-rag-embedding-dataset is a modified fork of virattt/llama-3-8b-financialQA for fine-tuning embedding models using positive text pairs (question, context). The dataset include 7,000 question, context pairs from NVIDIAs 2023 SEC Filling Report
h
rag-full-20000
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neural Bridge AI, rag-full-20000 [Dataset]. https://huggingface.co/datasets/neural-bridge/rag-full-20000
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset authored and provided by
Neural Bridge AI
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Retrieval-Augmented Generation (RAG) Full 20000

Retrieval-Augmented Generation (RAG) Full 20000 is an English dataset designed for RAG-optimized models, built by Neural Bridge AI, and released under Apache license 2.0.

Dataset Description Dataset Summary

Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by allowing them to consult an external authoritative knowledge base before generating responses. This approach significantly boosts… See the full description on the dataset page: https://huggingface.co/datasets/neural-bridge/rag-full-20000.
d
Large Language Model (LLM) Data | Machine Learning (ML) Data | AI Training...
datarade.ai
Updated Jan 23, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MealMe (2025). Large Language Model (LLM) Data | Machine Learning (ML) Data | AI Training Data (RAG) for 1M+ Global Grocery, Restaurant, and Retail Stores [Dataset]. https://datarade.ai/data-products/ai-training-data-rag-for-grocery-restaurant-and-retail-ra-mealme
Explore at:
.bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
Dataset updated
Jan 23, 2025
Dataset authored and provided by
MealMe
Area covered
Christmas Island, Trinidad and Tobago, Romania, Norfolk Island, Saint Lucia, Uruguay, Kosovo, Korea (Republic of), Andorra, Iceland
Description
A comprehensive dataset covering over 1 million stores in the US and Canada, designed for training and optimizing retrieval-augmented generation (RAG) models and other AI/ML systems. This dataset includes highly detailed, structured information such as:

Menus: Restaurant menus with item descriptions, categories, and modifiers. Inventory: Grocery and retail product availability, SKUs, and detailed attributes like sizes, flavors, and variations.

Pricing: Real-time and historical pricing data for dynamic pricing strategies and recommendations.

Availability: Real-time stock status and fulfillment details for grocery, restaurant, and retail items.

Applications: Retrieval-Augmented Generation (RAG): Train AI models to retrieve and generate contextually relevant information.

Search Optimization: Build advanced, accurate search and recommendation engines. Personalization: Enable personalized shopping, ordering, and discovery experiences in apps.

Data-Driven Insights: Develop AI systems for pricing analysis, consumer behavior studies, and logistics optimization.

This dataset empowers businesses in marketplaces, grocery apps, delivery services, and retail platforms to scale their AI solutions with precision and reliability.
h
RAG-Instruct
huggingface.co
Updated Jan 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
FreedomAI (2025). RAG-Instruct [Dataset]. https://huggingface.co/datasets/FreedomIntelligence/RAG-Instruct
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 9, 2025
Dataset authored and provided by
FreedomAI
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Introduction

RAG-Instruct is a RAG dataset designed to comprehensively enhance LLM RAG capabilities, synthesized using GPT-4o. This dataset is based on the Wikipedia corpus and This dataset is based on the Wikipedia corpus and offers the advantages of query-document scenario diversity and task diversity. The RAG-Instruct dataset can significantly enhance the RAG ability of LLMs and make remarkable improvements in RAG performance across various tasks.

Model WQA (acc) PQA (acc)… See the full description on the dataset page: https://huggingface.co/datasets/FreedomIntelligence/RAG-Instruct.
h
sds-news-rag
huggingface.co
Updated Apr 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jiarui Liu (2025). sds-news-rag [Dataset]. https://huggingface.co/datasets/Jerry999/sds-news-rag
Explore at:
Dataset updated
Apr 16, 2025
Authors
Jiarui Liu
Description
This dataset is derived from the Global News Dataset. Please refer to the original source (also cited below) and ensure that your use complies with its terms and conditions.

Webz.io News Dataset Repository Introduction

Welcome to the Webz.io News Dataset Repository! This repository is created by Webz.io and is dedicated to providing free datasets of publicly available news articles. We release new datasets weekly, each containing around 1,000 news articles focused on… See the full description on the dataset page: https://huggingface.co/datasets/Jerry999/sds-news-rag.
h
MultiHopRAG
huggingface.co
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
yixuan, MultiHopRAG [Dataset]. https://huggingface.co/datasets/yixuantt/MultiHopRAG
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Authors
yixuan
License
https://choosealicense.com/licenses/odc-by/https://choosealicense.com/licenses/odc-by/
Description
Dataset Card for Dataset Name

A Dataset for Evaluating Retrieval-Augmented Generation Across Documents

Dataset Description

MultiHop-RAG: a QA dataset to evaluate retrieval and reasoning across documents with metadata in the RAG pipelines. It contains 2556 queries, with evidence for each query distributed across 2 to 4 documents. The queries also involve document metadata, reflecting complex scenarios commonly found in real-world RAG applications.

Dataset Sources… See the full description on the dataset page: https://huggingface.co/datasets/yixuantt/MultiHopRAG.
k
RAG Stock: A Risky Investment (Forecast)
kappasignal.com
Updated Jul 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2023). RAG Stock: A Risky Investment (Forecast) [Dataset]. https://www.kappasignal.com/2023/07/rag-stock-risky-investment.html
Explore at:
Dataset updated
Jul 30, 2023
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

RAG Stock: A Risky Investment

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data
P
SQuAD Dataset
paperswithcode.com
Updated Oct 5, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
François Bienvenu; Mike Steel (2022). SQuAD Dataset [Dataset]. https://paperswithcode.com/dataset/squad
Explore at:
Dataset updated
Oct 5, 2022
Authors
François Bienvenu; Mike Steel
Description
The Stanford Question Answering Dataset (SQuAD) is a collection of question-answer pairs derived from Wikipedia articles. In SQuAD, the correct answers of questions can be any sequence of tokens in the given text. Because the questions and answers are produced by humans through crowdsourcing, it is more diverse than some other question-answering datasets. SQuAD 1.1 contains 107,785 question-answer pairs on 536 articles. SQuAD2.0 (open-domain SQuAD, SQuAD-Open), the latest version, combines the 100,000 questions in SQuAD1.1 with over 50,000 un-answerable questions written adversarially by crowdworkers in forms that are similar to the answerable ones.
h
pubmed
huggingface.co
Updated Feb 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MedRAG (2024). pubmed [Dataset]. https://huggingface.co/datasets/MedRAG/pubmed
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Feb 26, 2024
Authors
MedRAG
Description
The PubMed Corpus in MedRAG

This HF dataset contains the snippets from the PubMed corpus used in MedRAG. It can be used for medical Retrieval-Augmented Generation (RAG).

News

(02/26/2024) The "id" column has been reformatted. A new "PMID" column is added.

Dataset Details Dataset Descriptions

PubMed is the most widely used literature resource, containing over 36 million biomedical articles. For MedRAG, we use a PubMed subset of 23.9 million… See the full description on the dataset page: https://huggingface.co/datasets/MedRAG/pubmed.
h
RAG_diet
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_diet [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_diet
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR DIETS

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]
k
RAG Stock: The Next Bubble? (Forecast)
kappasignal.com
Updated Aug 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KappaSignal (2023). RAG Stock: The Next Bubble? (Forecast) [Dataset]. https://www.kappasignal.com/2023/08/rag-stock-next-bubble.html
Explore at:
Dataset updated
Aug 13, 2023
Dataset authored and provided by
KappaSignal
License
https://www.kappasignal.com/p/legal-disclaimer.htmlhttps://www.kappasignal.com/p/legal-disclaimer.html
Description
This analysis presents a rigorous exploration of financial data, incorporating a diverse range of statistical features. By providing a robust foundation, it facilitates advanced research and innovative modeling techniques within the field of finance.

RAG Stock: The Next Bubble?

Financial data:

Historical daily stock prices (open, high, low, close, volume)

Fundamental data (e.g., market capitalization, price to earnings P/E ratio, dividend yield, earnings per share EPS, price to earnings growth, debt-to-equity ratio, price-to-book ratio, current ratio, free cash flow, projected earnings growth, return on equity, dividend payout ratio, price to sales ratio, credit rating)

Technical indicators (e.g., moving averages, RSI, MACD, average directional index, aroon oscillator, stochastic oscillator, on-balance volume, accumulation/distribution A/D line, parabolic SAR indicator, bollinger bands indicators, fibonacci, williams percent range, commodity channel index)

Machine learning features:

Feature engineering based on financial data and technical indicators

Sentiment analysis data from social media and news articles

Macroeconomic data (e.g., GDP, unemployment rate, interest rates, consumer spending, building permits, consumer confidence, inflation, producer price index, money supply, home sales, retail sales, bond yields)

Potential Applications:

Stock price prediction

Portfolio optimization

Algorithmic trading

Market sentiment analysis

Risk management

Use Cases:

Researchers investigating the effectiveness of machine learning in stock market prediction

Analysts developing quantitative trading Buy/Sell strategies

Individuals interested in building their own stock market prediction models

Students learning about machine learning and financial applications

Additional Notes:

The dataset may include different levels of granularity (e.g., daily, hourly)

Data cleaning and preprocessing are essential before model training

Regular updates are recommended to maintain the accuracy and relevance of the data

happy_matrix_eco_data

kaggle.com

Updated May 5, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Olga Seymour (2025). happy_matrix_eco_data [Dataset]. https://www.kaggle.com/datasets/matrixbpm/eng-change-orders

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 5, 2025

Dataset provided by

Kagglehttp://kaggle.com/

Authors

Olga Seymour

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

This dataset contains four synthetic Engineering Change Order (ECO) documents, authored by me as part of the Google GenAI Capstone Challenge (Q1 2025).

These documents simulate realistic engineering change processes at a fictional wearable technology company named HappyMatrix, which is developing a conceptual product called MatrixSync X100.

Each .txt file captures a different engineering change—ranging from hardware updates and battery improvements to algorithm tuning and sustainable packaging—written in technical yet human-readable language.

📄 File Overview

File Name	Description
`ECO-100001.txt`	Enclosure update — added ventilation slots to improve thermal performance
`ECO-100002.txt`	Battery replacement — switch from lithium-polymer to solid-state for safety and longevity
`ECO-100003.txt`	Algorithm tuning — improved step detection accuracy in signal processing logic
`ECO-100004.txt`	Packaging redesign — introduced eco-friendly materials and minimized waste

📌 Key Highlights

Author: Olga Seymour
Company & Product: Fully fictional (HappyMatrix & MatrixSync X100)
Purpose: Educational use in a Generative AI Capstone
Use Case: Demonstrates document Q&A, RAG, embeddings, and structured output
Document Type: Unstructured .txt files resembling real ECOs

🎯 Intended Use

This dataset supports experimentation and learning in areas such as:

Retrieval-Augmented Generation (RAG)
Semantic vector search with embeddings
Few-shot prompting + structured output (JSON)
LLM-based document understanding and QA

Ideal for projects simulating GenAI applications in product lifecycle management, documentation review, and engineering operations.

🔒 Educational Use Only

These documents were authored entirely by me to support my GenAI Capstone notebook.
They do not represent any real company or proprietary information.
Any resemblance to existing products or organizations is purely coincidental.

📘 Companion Notebook

This dataset is used in the following notebook:
🧠 HappyMatrix ECO Assistant
A GenAI-powered tool for analyzing engineering change orders with LangChain, Gemini, and ChromaDB.

📄 License

This dataset is shared under the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license.

You are free to: - Share — copy and redistribute the material - Adapt — remix, transform, and build upon it

Under the following terms: - Attribution — You must give appropriate credit. - NonCommercial — You may not use the material for commercial purposes.

These ECO documents were created for educational demonstration purposes as part of the Google GenAI Capstone 2025.

h
RAG_recovery
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_recovery [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_recovery
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR RECOVERY

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]
h
RAG_workout_db
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_workout_db [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_workout_db
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR DIFFERENT WORKOUTS

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]
P
WFDD Dataset
paperswithcode.com
Updated Jul 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Qiyu Chen; Huiyuan Luo; Chengkan Lv; Zhengtao Zhang (2024). WFDD Dataset [Dataset]. https://paperswithcode.com/dataset/wfdd
Explore at:
Dataset updated
Jul 11, 2024
Authors
Qiyu Chen; Huiyuan Luo; Chengkan Lv; Zhengtao Zhang
Description
WFDD is a dataset for benchmarking anomaly detection methods with a focus on textile inspection. It includes 4101 woven fabric images categorized into 4 categories: grey cloth, grid cloth, yellow cloth, and pink flower. The first three classes are collected from the industrial production sites of WEIQIAO Textile, while the 'pink flower' class is gathered from the publicly available Cloth Flaw Dataset. Each category contains block-shape, point-like, and line-type defects with pixel-level annotations.
h
RAG_meals
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_meals [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_meals
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR MEALS

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]
h
RAG_supplements
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_supplements [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_supplements
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR SUPPLEMENTS

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]
h
RAG_strength_advice
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_strength_advice [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_strength_advice
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR STRENGTH WORKOUTS

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]
h
RAG_cardio
huggingface.co
Updated Jun 22, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Navaneeth K (2025). RAG_cardio [Dataset]. https://huggingface.co/datasets/navaneeth005/RAG_cardio
Explore at:
Dataset updated
Jun 22, 2025
Authors
Navaneeth K
Description
Dataset Card for Dataset Name

RAG FOR CARDIO WORKOUTS

Dataset Details Dataset Description

THIS IS A DATASET CREATED BY SLECTIVELY CHOOSING AND MERGING MULTIPLE DATASETS FROM VARIOUS SOURCERS INCLUDING OTHER DATASETS AND GENERATED DATASETS. FEEL FREE TO USE THESE ANYWHERE AND MAKE SURE TO CREDIT THE APPROPIATE DATA SOURCERS WHEREVER NECESSARY!! 😀

Curated by: [Navaneeth. K]

Facebook

Twitter

Click to copy link

Link copied

Cite

RAG Datasets (2025). rag-mini-wikipedia [Dataset]. https://huggingface.co/datasets/rag-datasets/rag-mini-wikipedia

rag-mini-wikipedia

rag-datasets/rag-mini-wikipedia

Explore at:

CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.

Dataset updated

May 5, 2025

Dataset authored and provided by

RAG Datasets

License

Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically

Description

In this huggingface discussion you can share what you used the dataset for. Derives from https://www.kaggle.com/datasets/rtatman/questionanswer-dataset?resource=download we generated our own subset using generate.py.

Clear search

Close search

Google apps

Main menu

rag-mini-wikipedia

finanical-rag-embedding-dataset

rag-full-20000

Large Language Model (LLM) Data | Machine Learning (ML) Data | AI Training...

RAG-Instruct

sds-news-rag

MultiHopRAG

RAG Stock: A Risky Investment (Forecast)

RAG Stock: A Risky Investment

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

SQuAD Dataset

pubmed

RAG_diet

RAG Stock: The Next Bubble? (Forecast)

RAG Stock: The Next Bubble?

Financial data:

Machine learning features:

Potential Applications:

Use Cases:

Additional Notes:

happy_matrix_eco_data

📄 File Overview

📌 Key Highlights

🎯 Intended Use

🔒 Educational Use Only

📘 Companion Notebook

📄 License

RAG_recovery

RAG_workout_db

WFDD Dataset

RAG_meals

RAG_supplements

RAG_strength_advice

RAG_cardio

rag-mini-wikipedia

rag-datasets/rag-mini-wikipedia