Facebook
Twitterpythia-70m-epochs-0-39-p3-O
This dataset contains reward model analysis results for IRL training.
Dataset Information
Base Model ID: ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-70m Full Model ID: ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-70m Epoch: 0 Analysis Timestamp: 2025-08-03T13:03:24.996419 Number of Samples: 18000
Columns
sample_index: Index of the sample prompt: Input prompt (if available)โฆ See the full description on the dataset page: https://huggingface.co/datasets/ajagota71/pythia-70m-epochs-0-39-p3-O.
Facebook
TwitterMechanistic-Anomaly-Detection/pythia-70m-deduped-memorized dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterDataset Card for "pythia-70m-rs"
More Information needed
Facebook
Twittertim-lawson/mlsae-pythia-70m-deduped-x256-k32-examples dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertommyp111/pythia-70m-layer-4-pile-resid-post-activations dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twitterpythia-70m-p3-gt
This dataset contains ground truth classification results for model evaluation.
Dataset Information
Model ID: s-nlp/roberta_toxicity_classifier Model Type: sequence_classification Analysis Timestamp: 2025-08-03T18:49:08.677506 Number of Samples: 2000
Columns
sample_index: Index of the sample prompt: Input prompt (if available) original_output: Original model output detoxified_output: Detoxified model output prompt_score: Classification scoreโฆ See the full description on the dataset page: https://huggingface.co/datasets/ajagota71/pythia-70m-p3-gt.
Facebook
Twitterpaacamo/nvidia-faq-EleutherAI-pythia-70m-fine-tuned dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twitterangerami/pythia-70m-deduped_weight_evolution_001 dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertim-lawson/mlsae-lens-std-pythia-70m-deduped-x64-k32-dists dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertim-lawson/mlsae-pythia-70m-deduped-x64-k128-examples dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertim-lawson/wikitext-2-v1-pythia-70m-loss-16-gram dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertim-lawson/wikitext-2-v1-pythia-70m-loss-4-gram dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertim-lawson/sae-pythia-70m-deduped-x64-k32-tfm-layers-2-dists dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
Twittertommyp111/gg-bridge-sharegpt-tokenized-pythia-70m-deduped dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterDataset Card for HuggingFaceH4/rs_test
SFT model: HuggingFaceH4/falcon-40b-ift-v3.1 Reward model: HuggingFaceH4/pythia-70m-rm-v0.0 Temperature: 0.7
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterpythia-70m-epochs-0-39-p3-O
This dataset contains reward model analysis results for IRL training.
Dataset Information
Base Model ID: ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-70m Full Model ID: ajagota71/toxicity-reward-model-v-head-output-max-margin-seed-42-pythia-70m Epoch: 0 Analysis Timestamp: 2025-08-03T13:03:24.996419 Number of Samples: 18000
Columns
sample_index: Index of the sample prompt: Input prompt (if available)โฆ See the full description on the dataset page: https://huggingface.co/datasets/ajagota71/pythia-70m-epochs-0-39-p3-O.