100+ datasets found

h
Civ-Models
huggingface.co
Updated May 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
pm-paper-datasets (2025). Civ-Models [Dataset]. https://huggingface.co/datasets/pm-paper-datasets/Civ-Models
Explore at:
Dataset updated
May 1, 2025
Authors
pm-paper-datasets
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
Dataset of models and their metadata obtained from CivitAI

This dataset is licensed under CC BY-NC 4.0, which allows for non-commercial use with proper attribution.

Column Preview Model Data Preview (Version ID columns summarized)

Column Name Description Example Value

id Unique identifier for the model on CivitAI 4201

name Name of the model Realistic Vision V6.0 B1

type Type of model (e.g., Checkpoint, LoRA, etc.) Checkpoint

baseModel Base… See the full description on the dataset page: https://huggingface.co/datasets/pm-paper-datasets/Civ-Models.
d
Neuromuscular Models Library
dknet.org
neuinfo.org
+2more
Updated Jan 29, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2022). Neuromuscular Models Library [Dataset]. http://identifiers.org/RRID:SCR_002682
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_002682
Dataset updated
Jan 29, 2022
Description
The goal of the neuromuscular models library is to provide a resource for students, researchers, and clinicians to access, use, test, and develop models. The majority of models in this library are for use with OpenSIM and/or SIMM. Users who contribute models to the database can set up a project page where they can track who is using the model and contact with them.
R
Iha Models Dataset
universe.roboflow.com
zip
Updated Apr 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
IHA (2025). Iha Models Dataset [Dataset]. https://universe.roboflow.com/iha-e4r7q/iha-models
Explore at:
zipAvailable download formats
Dataset updated
Apr 11, 2025
Dataset authored and provided by
IHA
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Iha Bounding Boxes
Description
IHA Models

## Overview IHA Models is a dataset for object detection tasks - it contains Iha annotations for 481 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
d
Enhancing Microsimulation Models for Improved Work Zone Planning:...
catalog.data.gov
data.virginia.gov
+1more
Updated Jul 19, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
US Department of Transportation (2024). Enhancing Microsimulation Models for Improved Work Zone Planning: Car-Following Data from Western Massachusetts (Runs) [Dataset]. https://catalog.data.gov/dataset/enhancing-microsimulation-models-for-improved-work-zone-planning-car-following-data-from-w-f28f8
Explore at:
Dataset updated
Jul 19, 2024
Dataset provided by
US Department of Transportation
Description
The data describe freeway car-following behavior (such as velocity, acceleration, and relative position) for the car-following instances observed during 6 data collection runs, collected using an Instrumented Research Vehicle (IRV) along freeways and arterials in western Massachusetts in the summer of 2016 to better understand work zone driver behaviors. The USDOT Volpe National Transportation Systems Center (Volpe Center) identified, isolated, and classified individual car following instances from within the raw datasets (classification parameters included roadway type, level of congestion, and speed limit), then processed, refined, and cleaned the dataset. This table contains metadata about each data collection run. See also the instances table (https://datahub.transportation.gov/Automobiles/Enhancing-Microsimulation-Models-for-Improved-Work/74ug-57tr) and radar table (https://datahub.transportation.gov/Automobiles/Enhancing-Microsimulation-Models-for-Improved-Work/4qbx-egtn).
h
huggingface-models-raw
huggingface.co
Updated Mar 2, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fazil (2022). huggingface-models-raw [Dataset]. https://huggingface.co/datasets/ftopal/huggingface-models-raw
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 2, 2022
Authors
Fazil
Description
ftopal/huggingface-models-raw dataset hosted on Hugging Face and contributed by the HF Datasets community
Determining the Predictive Limit of QSAR Models
catalog.data.gov
data.amerigeoss.org
Updated Dec 13, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2021). Determining the Predictive Limit of QSAR Models [Dataset]. https://catalog.data.gov/dataset/determining-the-predictive-limit-of-qsar-models
Explore at:
Dataset updated
Dec 13, 2021
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
The research done to evaluate how the predictivity of models are effected by error in either the training or the test set is simple to describe conceptually. Benchmark datasets are downloaded from reputable sources. Then the datasets are split into training and test sets. Randomized error is added and then models created on both error laden and native training sets. Those models are used to predict both error laden and native test sets. Differences in standard statistics commonly used to assess predictivity are observed. This dataset is associated with the following publication: Kolmar, S., and C. Grulke. The Effect of Noise on the Predictive Limit of QSAR Models. Journal of Cheminformatics. Springer, New York, NY, USA, 13: 92, (2021).
Model America: Data and Models for every U.S. Building
osti.gov
search.dataone.org
Updated Apr 14, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Environmental System Science Data Infrastructure for a Virtual Ecosystem (ESS-DIVE) (United States) (2021). Model America: Data and Models for every U.S. Building [Dataset]. http://doi.org/10.15485/2283980
Explore at:
Unique identifier
https://doi.org/10.15485/2283980
Dataset updated
Apr 14, 2021
Dataset provided by
Office of Sciencehttp://www.er.doe.gov/
United States Department of Energyhttp://energy.gov/
Southwest Urban Corridor Integrated Field Laboratory (SW-IFL)
Environmental System Science Data Infrastructure for a Virtual Ecosystem (ESS-DIVE) (United States)
Area covered
United States
Description
The 5-year goal of the “Model America” concept was to generate a model of every building in the United States. This data repository delivers on that goal with "Model America v1".Oak Ridge National Laboratory (ORNL) has developed the Automatic Building Energy Modeling (AutoBEM) software suite to process multiple types of data, extract building-specific descriptors, generate building energy models, and simulate them on High Performance Computing (HPC) resources. For more information, see AutoBEM-related publications (bit.ly/AutoBEM).There were 125,715,609 buildings detected in the United States. Of this number, 122,146,671 (97.2%) buildings resulted in a successful generation and simulation of a building energy model. This dataset includes the full 125 million buildings. Future updates may include additional buildings, data improvements, or other algorithmic model enhancements in "Model America v2".This dataset contains OSM and IDF zip files for every U.S. county. Each zip file contains the generated buildings from that county.The .csv input data contains the following data fields:1. ID - unique building ID2. Centroid - building center location in latitude/longitude (from Footprint2D)3. Footprint2D - building polygon of 2D footprint (lat1/lon1_lat2/lon2_...)4. State_abbr - state name5. Area - estimate of total conditioned floor area (ft2)6. Area2D - footprint area (ft2)7. Height - building height (ft)8. NumFloors - number of floors (above-grade)9. WWR_surfaces - percent of each facade (pair of points from Footprint2D) covered by fenestration/windows (average 14.5% for residential, 40% for commercial buildings)10. CZ - ASHRAE Climate Zone designation11. BuildingType - DOE prototype building designation (IECC=residential) as implemented by OpenStudio-standards12. Standard - building vintageThis data is made free and openly available in hopes of stimulating any simulation-informed use case. Data is provided as-is with no warranties, express or implied, regarding fitness for a particular purpose. We wish to thank our sponsors which include Oak Ridge National Laboratory (ORNL) Laboratory Directed Research and Development (LDRD), U.S. Dept. of Energy’s (DOE) Building Technologies Office (BTO), Office of Electricity (OE), Biological and Environmental Research (BER), and National Nuclear Security Administration (NNSA).
R
Korek Api Models Dataset
universe.roboflow.com
zip
Updated Jun 9, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HALFIAH (2024). Korek Api Models Dataset [Dataset]. https://universe.roboflow.com/halfiah/korek-api-models
Explore at:
zipAvailable download formats
Dataset updated
Jun 9, 2024
Dataset authored and provided by
HALFIAH
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Variables measured
Korek Api Bounding Boxes
Description
Korek Api Models

## Overview Korek Api Models is a dataset for object detection tasks - it contains Korek Api annotations for 200 images. ## Getting Started You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model. ## License This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
NASA 3D Models: Dawn
catalog.data.gov
data.nasa.gov
+3more
Updated Apr 11, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Aeronautics and Space Administration (2025). NASA 3D Models: Dawn [Dataset]. https://catalog.data.gov/dataset/nasa-3d-models-dawn
Explore at:
Dataset updated
Apr 11, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
This .stl file was produced by scaling the original model and converting it directly to .stl format.
STI Tagging Models
catalog.data.gov
data.nasa.gov
+1more
Updated Apr 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NASA (2025). STI Tagging Models [Dataset]. https://catalog.data.gov/dataset/sti-tagging-models
Explore at:
Dataset updated
Apr 10, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
Keyword models for a subset of the NASA Thesaurus (https://www.sti.nasa.gov/nasa-thesaurus/). These models were trained on the NASA Technical Reports Server (NTRS). These models can be used with the concept-tagging-api flask server (https://github.com/nasa/concept-tagging-api) to run a keyword predicting service.
c
Data from: SegSub: Evaluating Robustness to Knowledge Conflicts and...
kilthub.cmu.edu
json
Updated May 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peter Carragher (2025). SegSub: Evaluating Robustness to Knowledge Conflicts and Hallucinations in Vision-Language Models [Dataset]. http://doi.org/10.1184/R1/28297076.v3
Explore at:
jsonAvailable download formats
Unique identifier
https://doi.org/10.1184/R1/28297076.v3
Dataset updated
May 14, 2025
Dataset provided by
Carnegie Mellon University
Authors
Peter Carragher
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This research introduces \segsub, a framework for applying targeted image perturbations to investigate VLM resilience against knowledge conflicts. Our analysis reveals distinct vulnerability patterns: while VLMs are robust to parametric conflicts (20% adherence rates), they exhibit significant weaknesses in identifying counterfactual conditions (
n
Data from: Bayesian Inference for Growth Mixture Models with an Unknown...
curate.nd.edu
Updated Nov 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Meng Qiu (2024). Bayesian Inference for Growth Mixture Models with an Unknown Number of Classes [Dataset]. http://doi.org/10.7274/26761573.v1
Explore at:
Unique identifier
https://doi.org/10.7274/26761573.v1
Dataset updated
Nov 11, 2024
Dataset provided by
University of Notre Dame
Authors
Meng Qiu
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Growth mixture models (GMMs) have been widely used to capture different growth trajectories of unobserved subpopulations (or latent classes). The traditional GMM determines the optimal number of classes through a process called class enumeration, which involves fitting a sequence of models with an increasing number of classes and then selecting the best-fitting model using statistical criteria. Despite its popularity, class enumeration has long been criticized for introducing severe subjectivity when comparing the fitted models.

Bayesian nonparametric (BNP) mixture modeling offers an alternative approach to detecting latent classes. The BNP approach circumvents the subjectivity inherent in class enumeration by placing a prior on the mixing distribution, which indirectly induces a prior on the number of classes. Consequently, the number of classes can be inferred directly from the data. However, the BNP approach remains understudied in the context of GMM. To reduce this research gap, the dissertation aims to: 1) propose two BNP-GMMs using the Dirichlet process mixture and the mixture of finite mixtures models; 2) compare the performance of the two proposed models in determining the number of classes $K$ with that of the traditional GMM; and 3) evaluate the performance of the two proposed models in choosing K when using the mode versus when using a loss function called variation of information (VI).

Based on Monte Carlo simulations, Study 1 compares the proposed models and the traditional GMM in choosing K when there is no model misspecification, while Study 2 compares them in choosing K when there is model misspecification in the latent mean structure. Overall, simulation results showed that: 1) the proposed models using VI were more accurate than using the mode; 2) when the population was homogeneous (comprising only one class), the proposed models using VI yielded the highest accuracy in choosing K; whereas, when the population was heterogeneous (consisting of three classes), the proposed models using VI achieved superior accuracy in choosing K when class separation was large; and 3) the proposed models using VI demonstrated robustness against exacerbated overfitting caused by model misspecification. For illustration, the proposed BNP-GMMs were applied to data from the Early Childhood Longitudinal Study, Kindergarten Class of 1998-99.
NASA 3D Models: TDRS
catalog.data.gov
data.staging.idas-ds1.appdat.jsc.nasa.gov
+1more
Updated Apr 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Aeronautics and Space Administration (2025). NASA 3D Models: TDRS [Dataset]. https://catalog.data.gov/dataset/nasa-3d-models-tdrs-e0fa2
Explore at:
Dataset updated
Apr 11, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
Polygons: 51 Vertices: 32
a
Coding Index by Models Model
artificialanalysis.ai
Updated May 15, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2025). Coding Index by Models Model [Dataset]. https://artificialanalysis.ai/models
Explore at:
Dataset updated
May 15, 2025
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Represents the average of coding benchmarks in the Artificial Analysis Intelligence Index (LiveCodeBench & SciCode) by Model
Number of notable machine learning models globally in 2023, by geographic...
statista.com
Updated Jun 24, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Number of notable machine learning models globally in 2023, by geographic area [Dataset]. https://www.statista.com/statistics/1465312/notable-machine-learning-models-worldwide/
Explore at:
Dataset updated
Jun 24, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2023
Area covered
Worldwide
Description
The United States is by far the largest producer of notable machine learning programs in 2024, with **, ahead of China's **. It is notable that France, Germany, and UK, despite accounting for smaller economic size and population, now outproduce China on machine learning programs together, producing some ** models versus China's **.
a
Pricing: Image Input Pricing by Models Model
artificialanalysis.ai
Updated May 15, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2025). Pricing: Image Input Pricing by Models Model [Dataset]. https://artificialanalysis.ai/models
Explore at:
Dataset updated
May 15, 2025
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Image Input Price: USD per 1k images at 1MP (1024x1024) by Model
h
whisper-models
huggingface.co
Updated Dec 3, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Promila Ghosh (2024). whisper-models [Dataset]. https://huggingface.co/datasets/pr0mila/whisper-models
Explore at:
Dataset updated
Dec 3, 2024
Authors
Promila Ghosh
Description
pr0mila/whisper-models dataset hosted on Hugging Face and contributed by the HF Datasets community
Black-box models for TERP interpretation
figshare.com
zip
Updated Apr 2, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shams Mehdi (2024). Black-box models for TERP interpretation [Dataset]. http://doi.org/10.6084/m9.figshare.24475003.v2
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.24475003.v2
Dataset updated
Apr 2, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Shams Mehdi
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
TERP is a post-hoc interpretation scheme for explaining black-box AI predictions. TERP works by constructing a linear, local interpretable model that approximates the black-box in the vicinity of the instance being explained. TERP determines the accuracy-interpretability trade-off by introducing and using the concept of interpretation entropy.This data repository contains the three trained machine learning models: VAMPnets, Vision Transformers models (ViT) - pre-trained (model.ckpt)+ fine-tuned (best-model.ckpt) + fine-tuned_data_randomized (bad-model.ckpt), attention-based bi-directional LSTM) trained on molecular dynamics simulation trajectory of alanine dipeptide, facial attributes of celebrities (CelebA), and Antonio Gulli’s (AG’s) news corpus respectively. The simulated trajectory (dihedral angles) for the molecular dynamics simulation is also provided.
D
Notable AI Models
epoch.ai
csv
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Epoch AI, Notable AI Models [Dataset]. https://epoch.ai/data/notable-ai-models
Explore at:
csvAvailable download formats
Dataset authored and provided by
Epoch AI
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Global
Variables measured
https://epoch.ai/data/notable-ai-models-documentation#records
Measurement technique
https://epoch.ai/data/notable-ai-models-documentation#records
Description
Our most comprehensive database of AI models, containing over 800 models that are state of the art, highly cited, or otherwise historically notable. It tracks key factors driving machine learning progress and includes over 300 training compute estimates.
a
Math Index by Models Model
artificialanalysis.ai
Updated May 15, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Artificial Analysis (2025). Math Index by Models Model [Dataset]. https://artificialanalysis.ai/models
Explore at:
Dataset updated
May 15, 2025
Dataset authored and provided by
Artificial Analysis
Description
Comparison of Represents the average of math benchmarks in the Artificial Analysis Intelligence Index (AIME 2024 & Math-500) by Model

Facebook

Twitter

Click to copy link

Link copied

Cite

pm-paper-datasets (2025). Civ-Models [Dataset]. https://huggingface.co/datasets/pm-paper-datasets/Civ-Models

Civ-Models

pm-paper-datasets/Civ-Models

Explore at:

33 scholarly articles cite this dataset (View in Google Scholar)

Dataset updated

May 1, 2025

Authors

pm-paper-datasets

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Dataset of models and their metadata obtained from CivitAI

This dataset is licensed under CC BY-NC 4.0, which allows for non-commercial use with proper attribution.

  Column Preview





  Model Data Preview (Version ID columns summarized)

Column Name Description Example Value

id Unique identifier for the model on CivitAI 4201

name Name of the model Realistic Vision V6.0 B1

type Type of model (e.g., Checkpoint, LoRA, etc.) Checkpoint

baseModel Base… See the full description on the dataset page: https://huggingface.co/datasets/pm-paper-datasets/Civ-Models.

Clear search

Close search

Google apps

Main menu

Civ-Models

Neuromuscular Models Library

Iha Models Dataset

IHA Models

Enhancing Microsimulation Models for Improved Work Zone Planning:...

huggingface-models-raw

Determining the Predictive Limit of QSAR Models

Model America: Data and Models for every U.S. Building

Korek Api Models Dataset

Korek Api Models

NASA 3D Models: Dawn

STI Tagging Models

Data from: SegSub: Evaluating Robustness to Knowledge Conflicts and...

Data from: Bayesian Inference for Growth Mixture Models with an Unknown...

NASA 3D Models: TDRS

Coding Index by Models Model

Number of notable machine learning models globally in 2023, by geographic...

Pricing: Image Input Pricing by Models Model

whisper-models

Black-box models for TERP interpretation

Notable AI Models

Math Index by Models Model

Civ-Models

pm-paper-datasets/Civ-Models