100+ datasets found
  1. t

    Why AI Can't Crack Your Database - Data Analysis

    • tomtunguz.com
    Updated Aug 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tomasz Tunguz (2025). Why AI Can't Crack Your Database - Data Analysis [Dataset]. https://tomtunguz.com/spider-2-benchmark-trends/
    Explore at:
    Dataset updated
    Aug 13, 2025
    Dataset provided by
    Theory Ventures
    Authors
    Tomasz Tunguz
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Explore why AI excels at complex math but struggles with SQL queries, with benchmark data showing a 60% accuracy ceiling in database operations across leading models.

  2. t

    The MIT-BIH Long Term Database - Dataset - LDM

    • service.tib.eu
    • resodate.org
    Updated Dec 3, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). The MIT-BIH Long Term Database - Dataset - LDM [Dataset]. https://service.tib.eu/ldmservice/dataset/the-mit-bih-long-term-database
    Explore at:
    Dataset updated
    Dec 3, 2024
    Description

    Five open ECG databases from PhysioNet are involved in this study namely the MIT-BIH arrhythmia database,St-Petersburg Institute of Cardiological Technics 12-lead Arrhythmia Database,The MIT-BIH Normal Sinus Rhythm Database,The MIT-BIH Long Term Database and European ST-T Database.

  3. T-100 Domestic Market and Segment Data

    • catalog.data.gov
    • geodata.bts.gov
    • +1more
    Updated Sep 5, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bureau of Transportation Statistics (BTS) (Point of Contact) (2025). T-100 Domestic Market and Segment Data [Dataset]. https://catalog.data.gov/dataset/t-100-domestic-market-and-segment-data1
    Explore at:
    Dataset updated
    Sep 5, 2025
    Dataset provided by
    Bureau of Transportation Statisticshttp://www.rita.dot.gov/bts
    Description

    The T-100 Domestic Market and Segment Data dataset was downloaded on April 08, 2025 from the Bureau of Transportation Statistics (BTS) and is part of the U.S. Department of Transportation (USDOT)/Bureau of Transportation Statistics (BTS) National Transportation Atlas Database (NTAD). It shows 2024 statistics for all domestic airports operated by US carriers, and all information are totals for the year across all four (4) service classes (F - Scheduled Passenger/ Cargo Service, G - Scheduled All Cargo Service, L - Non-Scheduled Civilian Passenger/ Cargo Service, and P - Non-Scheduled Civilian All Cargo Service). This dataset is a combination of both T-100 Market and T-100 Segments datasets. The T-100 Market includes enplanement data, and T-100 Segment data includes passengers, arrivals, departures, freight, and mail. Data is by origin airport. Along with yearly aggregate totals for these variables, this dataset also provides more granular information for the passenger and freight variable by service class and by scheduled vs non-scheduled statistics where applicable. A data dictionary, or other source of attribute information, is accessible at https://doi.org/10.21949/1529081

  4. Foreign Trade Zones Manufacturing (T/IM) Database

    • catalog.data.gov
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    International Trade Administration (2025). Foreign Trade Zones Manufacturing (T/IM) Database [Dataset]. https://catalog.data.gov/dataset/foreign-trade-zones-manufacturing-t-im-database
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    International Trade Administrationhttp://trade.gov/
    Description

    This database allows the public to browse and search recent FTZ Board manufacturing approvals.

  5. T-DNA database

    • figshare.com
    txt
    Updated Jun 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Galina Khafizova; Tatiana Matveeva (2023). T-DNA database [Dataset]. http://doi.org/10.6084/m9.figshare.5754120.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Galina Khafizova; Tatiana Matveeva
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This file is a database, containing all Agrobacterium's genes, found in different types of T-DNA.The head of each sequences contains names of the gen, strain, species, and an Acs.number.

  6. f

    Data Sheet 1_A large-scale database of T-cell receptor beta sequences and...

    • datasetcatalog.nlm.nih.gov
    • frontiersin.figshare.com
    Updated Feb 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Semprini, Simona; Svejnoha, Emily; Pesesky, Mitchell W.; Delmonte, Ottavia M.; Dines, Jennifer N.; Goldman, Jason D.; Martinelli, Giovanni; Carlson, Jonathan M.; Howie, Bryan; Carreño-Tarragona, Gonzalo; Heath, James R.; Boland, Katie; Cerchione, Claudio; Nicolini, Fabio; Craft, Tracy; Snyder, Thomas M.; Klinger, Mark; Kaplan, Ian M.; Martinez-Lopez, Joaquin; Vignali, Marissa; Sambri, Vittorio; Dobbs, Kerry; Robins, Harlan S.; Nolan, Sean; Gittelman, Rachel M.; Gooley, Christopher J.; Notarangelo, Luigi D.; Barrio, Santiago; Mazza, Massimiliano (2025). Data Sheet 1_A large-scale database of T-cell receptor beta sequences and binding associations from natural and synthetic exposure to SARS-CoV-2.pdf [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001284335
    Explore at:
    Dataset updated
    Feb 17, 2025
    Authors
    Semprini, Simona; Svejnoha, Emily; Pesesky, Mitchell W.; Delmonte, Ottavia M.; Dines, Jennifer N.; Goldman, Jason D.; Martinelli, Giovanni; Carlson, Jonathan M.; Howie, Bryan; Carreño-Tarragona, Gonzalo; Heath, James R.; Boland, Katie; Cerchione, Claudio; Nicolini, Fabio; Craft, Tracy; Snyder, Thomas M.; Klinger, Mark; Kaplan, Ian M.; Martinez-Lopez, Joaquin; Vignali, Marissa; Sambri, Vittorio; Dobbs, Kerry; Robins, Harlan S.; Nolan, Sean; Gittelman, Rachel M.; Gooley, Christopher J.; Notarangelo, Luigi D.; Barrio, Santiago; Mazza, Massimiliano
    Description

    We describe the establishment and current content of the ImmuneCODE™ database, which includes hundreds of millions of T-cell Receptor (TCR) sequences from over 1,400 subjects exposed to or infected with the SARS-CoV-2 virus, as well as over 160,000 high-confidence SARS-CoV-2-associated TCRs. This database is made freely available, and the data contained in it can be used to assist with global efforts to understand the immune response to the SARS-CoV-2 virus and develop new interventions.

  7. p

    QT Database

    • physionet.org
    Updated Nov 16, 1999
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roger Mark; George Moody (1999). QT Database [Dataset]. http://doi.org/10.13026/C24K53
    Explore at:
    Dataset updated
    Nov 16, 1999
    Authors
    Roger Mark; George Moody
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Description

    Over 100 fifteen-minute two-lead ECG recordings (many excerpted from other databases), with onset, peak, and end markers for P, QRS, T, and (where present) U waves of from 30 to 50 selected beats in each recording.

  8. SpaceEye-T VVHR EO Open Data

    • registry.opendata.aws
    Updated Sep 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SI Imaging Services (2025). SpaceEye-T VVHR EO Open Data [Dataset]. https://registry.opendata.aws/st-open-data/
    Explore at:
    Dataset updated
    Sep 26, 2025
    Dataset provided by
    SI Imaging Services Co., Ltd.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    SpaceEye-T satellite collects the highest resolution optical imagery among the commercial satellites, 25 cm resolution. The Open Data features various satellite images around the world for end users to experience the power of VVHR optical data.

  9. d

    SYFPEITHI: A Database for MHC Ligands and Peptide Motifs

    • dknet.org
    • neuinfo.org
    • +2more
    Updated Sep 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). SYFPEITHI: A Database for MHC Ligands and Peptide Motifs [Dataset]. http://identifiers.org/RRID:SCR_013182
    Explore at:
    Dataset updated
    Sep 5, 2024
    Description

    SYFPEITHI is a database comprising more than 7000 peptide sequences known to bind class I and class II MHC molecules. The entries are compiled from published reports only. It contains a collection of MHC class I and class II ligands and peptide motifs of humans and other species, such as apes, cattle, chicken, and mouse, for example, and is continuously updated. Searches for MHC alleles, MHC motifs, natural ligands, T-cell epitopes, source proteins/organisms and references are possible. Hyperlinks to the EMBL and PubMed databases are included. In addition, ligand predictions are available for a number of MHC allelic products. The database is based on previous publications on T-cell epitopes and MHC ligands. It contains information on: -Peptide sequences -anchor positions -MHC specificity -source proteins, source organisms -publication references Since the number of motifs continuously increases, it was necessary to set up a database which facilitates the search for peptides and allows the prediction of T-cell epitopes. The prediction is based on published motifs (pool sequencing, natural ligands) and takes into consideration the amino acids in the anchor and auxiliary anchor positions, as well as other frequent amino acids. The score is calculated according to the following rules: The amino acids of a certain peptide are given a specific value depending on whether they are anchor, auxiliary anchor or preferred residue. Ideal anchors will be given 10 points, unusual anchors 6-8 points, auxiliary anchors 4-6 and preferred residues 1-4 points. Amino acids that are regarded as having a negative effect on the binding ability are given values between -1 and -3. Sponsors: SYFPEITHI is supported by DFG-Sonderforschungsbereich 685 and theEuropean Union: EU BIOMED CT95-1627, BIOTECH CT95-0263, and EU QLQ-CT-1999-00713.

  10. Assessments and Taxation Database, MD Property View 2004, Harford County

    • search.dataone.org
    Updated Oct 14, 2013
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne (2013). Assessments and Taxation Database, MD Property View 2004, Harford County [Dataset]. https://search.dataone.org/view/knb-lter-bes.354.570
    Explore at:
    Dataset updated
    Oct 14, 2013
    Dataset provided by
    Long Term Ecological Research Networkhttp://www.lternet.edu/
    Authors
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne
    Time period covered
    Jan 1, 2004 - Nov 17, 2011
    Area covered
    Description

    Assessments Taxation (A and T) Database from MD Property View 2004 for Harford County. The A and T Database contains parcel data from the State Department of Assessments and Taxation; it incorporates parcel ownership and address information, parcel valuation information and basic information about the land and structure(s) associated with a given parcel. These data form the basis for the 2004 Database, which also includes selected Computer Assisted Mass Appraisal (CAMA) characteristics, text descriptions to make parcel code field data more readily accessible and logical True/False fields which identify parcels with certain characteristics. Documentation for A and T, including a thorough definition for all attributes is enclosed. Complete Property View documentation can be found at http://www.mdp.state.md.us/data/index.htm under the "Technical Background" tab. It should be noted that the A and T Database consists of points and not parcel boundaries. For those areas where parcel polygon data exists the A and T Database can be joined using the ACCTID or a concatenation of the BLOCK and LOT fields, whichever is appropriate. (Spaces may have to be excluded when concatenating the BLOCK and LOT fields). A cursory review of the 2004 version of the A and T Database indicates that it has more accurate data when compared with the 2003 version, particularly with respect to dwelling types. However, for a given record it is not uncommon for numerous fields to be missing attributes. Based on previous version of the A and T Database it is also not unlikely that some of the information is inaccurate. This layer was edited to remove points that did not have a valid location because they failed to geocode. There were 690 such points. A listing of the deleted points is in the table with the suffix "DeletedRecords." This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful. This is part of a collection of 221 Baltimore Ecosystem Study metadata records that point to a geodatabase. The geodatabase is available online and is considerably large. Upon request, and under certain arrangements, it can be shipped on media, such as a usb hard drive. The geodatabase is roughly 51.4 Gb in size, consisting of 4,914 files in 160 folders. Although this metadata record and the others like it are not rich with attributes, it is nonetheless made available because the data that it represents could be indeed useful.

  11. n

    Data from: MHC-Peptide Interaction Database

    • neuinfo.org
    • dknet.org
    • +1more
    Updated Jan 29, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). MHC-Peptide Interaction Database [Dataset]. http://identifiers.org/RRID:SCR_007784
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    The MHC-Peptide Interaction Database version T (MPID-T) is a new generation database for sequence-structure-function information on T cell receptor/peptide/MHC interactions. It contains all structures of TcR/pMHC and pMHC complexes, with emphasis on the structural characterization of these complexes. MPID-T will facilitate the development of algorithms to predict whether a peptide sequence will bind to a specific MHC allele. The database has been populated with the data from the Protein Data Bank(PDB). The data from the PDB is manually verified and classified, after which each structure is analysed for atomic interactions relevant to MHC-Peptide complex.

  12. e

    GIS Shapefile, Assessments and Taxation Database, MD Property View 2004,...

    • portal.edirepository.org
    zip
    Updated Nov 15, 2004
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jarlath O'Neil-Dunne; Morgan Grove (2004). GIS Shapefile, Assessments and Taxation Database, MD Property View 2004, Anne Arundel County [Dataset]. http://doi.org/10.6073/pasta/90423ac99514c6adfc4b131a8221c5a2
    Explore at:
    zip(28561 kilobyte)Available download formats
    Dataset updated
    Nov 15, 2004
    Dataset provided by
    EDI
    Authors
    Jarlath O'Neil-Dunne; Morgan Grove
    Time period covered
    Jan 1, 2004 - Jan 1, 2005
    Area covered
    Description

    AT_2004_ANNE

       File Geodatabase Feature Class
    
    
       Thumbnail Not Available
    
       Tags
    
       Socio-economic resources, Information, Social Institutions, Hierarchy, Territory, BES, Parcel, Property, Property View, A&T, Database, Assessors, Taxation
    
    
    
    
       Summary
    
    
       Serves as a basis for performing various analyses based on parcel data.
    
    
       Description
    
    
       Assessments & Taxation (A&T) Database from MD Property View 2004 for Anne Arundel County. The A&T Database contains parcel data from the State Department of Assessments and Taxation; it incorporates parcel ownership and address information, parcel valuation information and basic information about the land and structure(s) associated with a given parcel. These data form the basis for the 2004 Database, which also includes selected Computer Assisted Mass Appraisal (CAMA) characteristics, text descriptions to make parcel code field data more readily accessible and logical True/False fields which identify parcels with certain characteristics. Documentation for A&T, including a thorough definition for all attributes is enclosed. Complete Property View documentation can be found at http://www.mdp.state.md.us/data/index.htm under the "Technical Background" tab.
    
    
       It should be noted that the A&T Database consists of points and not parcel boundaries. For those areas where parcel polygon data exists the A&T Database can be joined using the ACCTID or a concatenation of the BLOCK and LOT fields, whichever is appropriate. (Spaces may have to be excluded when concatenating the BLOCK and LOT fields).
    
    
       A cursory review of the 2004 version of the A&T Database indicates that it has more accurate data when compared with the 2003 version, particularly with respect to dwelling types. However, for a given record it is not uncommon for numerous fields to be missing attributes. Based on previous version of the A&T Database it is also not unlikely that some of the information is inaccurate. This layer was edited to remove points that did not have a valid location because they failed to geocode. There were 897 such points. A listing of the deleted points is in the table with the suffix "DeletedRecords."
    
    
       Credits
    
    
       Maryland Department of Planning
    
    
       Use limitations
    
    
       BES use only.
    
    
       Extent
    
    
    
       West -76.838738  East -76.395283 
    
       North 39.238726  South 38.708588 
    
    
    
    
       Scale Range
    
       There is no scale range for this item.
    
  13. Data from: Neoantigen-1a-Fusion-Database-Generation

    • zenodo.org
    application/gzip, bin
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Katherine Do; Katherine Do (2025). Neoantigen-1a-Fusion-Database-Generation [Dataset]. http://doi.org/10.5281/zenodo.14365542
    Explore at:
    bin, application/gzipAvailable download formats
    Dataset updated
    Jun 30, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Katherine Do; Katherine Do
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A neoantigen is a novel peptide (protein fragment) that is produced by cancer cells due to mutations, including gene fusions, that alter the DNA sequence in a way that generates unique proteins not found in normal cells. Because these mutated proteins are unique to the tumor, they are recognized as "foreign" by the immune system. Neoantigens are valuable in immunotherapy because they can serve as specific targets for the immune system, allowing treatments to selectively attack cancer cells while sparing normal tissue. By stimulating an immune response specifically against these neoantigens, therapies like cancer vaccines or T-cell-based treatments can be developed to enhance the body’s natural defense mechanisms, making neoantigens a promising avenue for personalized cancer treatment.

    Creating a fusion database is essential in cancer genomics and personalized medicine, as it enables the identification of crucial biomarkers, enhances diagnostic accuracy, and supports therapeutic development. Gene fusions, where parts of two previously separate genes merge, can produce abnormal proteins that drive cancer. Cataloging these fusion events in a database helps researchers identify specific biomarkers linked to cancer types and design more targeted treatments. Additionally, fusion events may lead to unique peptide sequences, known as neoantigens, which are found only in cancer cells. These neoantigens can be targeted by the immune system, making fusion databases valuable in designing personalized immunotherapies like cancer vaccines or T-cell therapies. Some gene fusions also create oncogenic proteins that promote tumor growth, such as the BCR-ABL fusion in chronic myeloid leukemia. Including such information in a database aids in identifying potential therapeutic targets and predicting treatment efficacy. On the diagnostic side, known gene fusions serve as reliable markers, helping clinicians better classify cancer types and choose the most effective treatments. Finally, fusion databases provide a critical reference for researchers studying fusion mechanisms, their impact on disease progression, and their prevalence across cancers, ultimately fueling the discovery of novel treatments and therapies.

  14. GIS Shapefile - GIS Shapefile, Assessments and Taxation Database, MD...

    • search.dataone.org
    • portal.edirepository.org
    Updated Apr 5, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne; Morgan Grove (2019). GIS Shapefile - GIS Shapefile, Assessments and Taxation Database, MD Property View 2004, Carroll County [Dataset]. https://search.dataone.org/view/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fmetadata%2Feml%2Fknb-lter-bes%2F353%2F610
    Explore at:
    Dataset updated
    Apr 5, 2019
    Dataset provided by
    Long Term Ecological Research Networkhttp://www.lternet.edu/
    Authors
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne; Morgan Grove
    Time period covered
    Jan 1, 2004 - Jan 1, 2005
    Area covered
    Description

    AT_2004_CARR File Geodatabase Feature Class Thumbnail Not Available Tags Socio-economic resources, Information, Social Institutions, Hierarchy, Territory, BES, Parcel, Property, Property View, A&T, Database, Assessors, Taxation Summary Serves as a basis for performing various analyses based on parcel data. Description Assessments & Taxation (A&T) Database from MD Property View 2004 for Carroll County. The A&T Database contains parcel data from the State Department of Assessments and Taxation; it incorporates parcel ownership and address information, parcel valuation information and basic information about the land and structure(s) associated with a given parcel. These data form the basis for the 2004 Database, which also includes selected Computer Assisted Mass Appraisal (CAMA) characteristics, text descriptions to make parcel code field data more readily accessible and logical True/False fields which identify parcels with certain characteristics. Documentation for A&T, including a thorough definition for all attributes is enclosed. Complete Property View documentation can be found at http://www.mdp.state.md.us/data/index.htm under the "Technical Background" tab. It should be noted that the A&T Database consists of points and not parcel boundaries. For those areas where parcel polygon data exists the A&T Database can be joined using the ACCTID or a concatenation of the BLOCK and LOT fields, whichever is appropriate. (Spaces may have to be excluded when concatenating the BLOCK and LOT fields). A cursory review of the 2004 version of the A&T Database indicates that it has more accurate data when compared with the 2003 version, particularly with respect to dwelling types. However, for a given record it is not uncommon for numerous fields to be missing attributes. Based on previous version of the A&T Database it is also not unlikely that some of the information is inaccurate. This layer was edited to remove points that did not have a valid location because they failed to geocode. There were 848 such points. A listing of the deleted points is in the table with the suffix "DeletedRecords." Credits Maryland Department of Planning Use limitations BES use only. Extent West -77.306843 East -76.779275 North 39.727017 South 39.342858 Scale Range There is no scale range for this item.

  15. Geospatial data for the Vegetation Mapping Inventory Project of Booker T....

    • catalog.data.gov
    • data.amerigeoss.org
    Updated Oct 23, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Park Service (2025). Geospatial data for the Vegetation Mapping Inventory Project of Booker T. Washington National Monument [Dataset]. https://catalog.data.gov/dataset/geospatial-data-for-the-vegetation-mapping-inventory-project-of-booker-t-washington-nation
    Explore at:
    Dataset updated
    Oct 23, 2025
    Dataset provided by
    National Park Servicehttp://www.nps.gov/
    Description

    The files linked to this reference are the geospatial data created as part of the completion of the baseline vegetation inventory project for the NPS park unit. Current format is ArcGIS file geodatabase but older formats may exist as shapefiles. A vegetation map of Booker T. Washington National Monument was created following the USGS-NPS Vegetation Mapping Program protocols. These vegetation associations were crosswalked to the Natural Communities of Virginia and to the USNVC in order to provide a regional and global context for the park’s vegetation. A field key to the map classes and detailed descriptions for each map class were developed to assist with field recognition and classification. Additional products associated with this project include: leaf-on and leaf-off orthophoto mosaics, database of vegetation plot data, digital photos of vegetation associations, and spatial data files for the vegetation map and plot sample points with associated Federal Geographic Data Committee (FGDC)-compliant metadata.

  16. f

    Desktop analysis and examination of six key food composition databases...

    • plos.figshare.com
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Annabel K. Clancy; Kaitlyn Woods; Anne McMahon; Yasmine Probst (2023). Desktop analysis and examination of six key food composition databases format. [Dataset]. http://doi.org/10.1371/journal.pone.0142137.t003
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOS ONE
    Authors
    Annabel K. Clancy; Kaitlyn Woods; Anne McMahon; Yasmine Probst
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    a Food Standards Australia and New Zealand,b United States Department of Agriculture,c Food Standards Agency,d Separate databases for flavonoids, carotenoids, proanthocyanidins and isoflavones,e Eurofir EBASIS contains bioactive data for UK and Europe,f National Health Survey,ghttps://www.xyris.com.au/foodworks/fw_pro.html,hhttp://www.nutribase.com/highend.html,ihttp://www.foodresearch.ca/wp-content/uploads/2013/06/candat-features-1.pdf,j Tinuviel Software,i Downlees Systems,k Forestfield Software,l Kelicomp,mhttp://www.tinuvielsoftware.com/faqs.htm,nhttp://www.dietsoftware.com/canada.html,o Text file: a file that only contains text,p A file containing tables of information stored in columns and separated by tabs (can be exported into almost any spreadsheet program),q Microsoft Excel spreadsheet,r Microsoft Access Database file: is a database file with automated functions and queries,s American Standard Code for Information Interchange (a standard file type that can be used by many programs),t Database File Format (this file type can be opened with Microsoft Excel and Access),u information to create Excel or PDF available,v Composition of Foods, Australia,w International Network of Food Data System,x Users guide states food name is most descriptive & recognisable of food referencedyhttp://www.foodstandards.gov.au/science/monitoringnutrients/nutrientables/nuttab/Pages/NUTTAB-2010-electronic-database-files.aspx,zhttp://www.foodstandards.gov.au/science/monitoringnutrients/ausnut/ausnutdatafiles/Pages/default.aspx,aahttp://ndb.nal.usda.gov/ndb/search/list,bbhttp://tna.europarchive.org/20110116113217/http://www.food.gov.uk/science/dietarysurveys/dietsurveys/,cchttp://webprod3.hc-sc.gc.ca/cnf-fce/index-eng.jspDesktop analysis and examination of six key food composition databases format.

  17. E

    VERBA Polytechnic and Plurilingual Terminological Database - T-AI Printing...

    • catalog.elra.info
    • live.european-language-grid.eu
    Updated Jun 27, 2016
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency) (2016). VERBA Polytechnic and Plurilingual Terminological Database - T-AI Printing Industry [Dataset]. https://catalog.elra.info/en-us/repository/browse/ELRA-T0316/
    Explore at:
    Dataset updated
    Jun 27, 2016
    Dataset provided by
    ELRA (European Language Resources Association) and its operational body ELDA (Evaluations and Language resources Distribution Agency)
    ELRA (European Language Resources Association)
    License

    https://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf

    Description
    • Entries for English-Spanish: Scientific research & mathematical sciences (906 entries), Geosciences (10,215), Computer science, electronics & telecommunications (70,580), Industry (47,578), Transport & Maintenance (12,291), Economy (145,572), Biological sciences (38,989), Communication & media (8,143), Chemical & physical sciences (27,467). * Entries for English-French-German-Spanish: Environment (36,658), Health (66,727), Agriculture & food (25,975), Construction & public works (8,429), Law & policy (56,578), Sports & Leisure (17,312) * Two specialized lexicons: Spanish-English and English-French-German without domain codes: electronics, telematics, law, taxes, customs, etc. (550,000 entries). * Two general lexicons: Spanish-English-French-German and Spanish-English-French-German-Portuguese-Italian (83,000 entries).This terminological database contains, for each domain, a sub-domain indication is given (from 2 sub-domains for Scientific research to 39 for Sports & leisure). Each entry consists of a definition, phraseological unit, abbreviation, usage information, grammatical labels. Format: ASCII
  18. GIS Shapefile - GIS Shapefile, Assessments and Taxation Database, MD...

    • search.dataone.org
    • portal.edirepository.org
    Updated Apr 3, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne; Morgan Grove (2019). GIS Shapefile - GIS Shapefile, Assessments and Taxation Database, MD Property View 2004, Harford County [Dataset]. https://search.dataone.org/view/https%3A%2F%2Fpasta.lternet.edu%2Fpackage%2Fmetadata%2Feml%2Fknb-lter-bes%2F354%2F610
    Explore at:
    Dataset updated
    Apr 3, 2019
    Dataset provided by
    Long Term Ecological Research Networkhttp://www.lternet.edu/
    Authors
    Cary Institute Of Ecosystem Studies; Jarlath O'Neil-Dunne; Morgan Grove
    Time period covered
    Jan 1, 2004 - Jan 1, 2005
    Area covered
    Description

    AT_2004_HARF File Geodatabase Feature Class Thumbnail Not Available Tags Socio-economic resources, Information, Social Institutions, Hierarchy, Territory, BES, Parcel, Property, Property View, A&T, Database, Assessors, Taxation Summary Serves as a basis for performing various analyses based on parcel data. Description Assessments & Taxation (A&T) Database from MD Property View 2004 for Harford County. The A&T Database contains parcel data from the State Department of Assessments and Taxation; it incorporates parcel ownership and address information, parcel valuation information and basic information about the land and structure(s) associated with a given parcel. These data form the basis for the 2004 Database, which also includes selected Computer Assisted Mass Appraisal (CAMA) characteristics, text descriptions to make parcel code field data more readily accessible and logical True/False fields which identify parcels with certain characteristics. Documentation for A&T, including a thorough definition for all attributes is enclosed. Complete Property View documentation can be found at http://www.mdp.state.md.us/data/index.htm under the "Technical Background" tab. It should be noted that the A&T Database consists of points and not parcel boundaries. For those areas where parcel polygon data exists the A&T Database can be joined using the ACCTID or a concatenation of the BLOCK and LOT fields, whichever is appropriate. (Spaces may have to be excluded when concatenating the BLOCK and LOT fields).

  19. r

    TOPOFIT Database

    • rrid.site
    • dknet.org
    • +2more
    Updated Jan 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2022). TOPOFIT Database [Dataset]. http://identifiers.org/RRID:SCR_006936
    Explore at:
    Dataset updated
    Jan 29, 2022
    Description

    THIS RESOURCE IS NO LONGER IN SERVICE, documented on August 26, 2016. The T-DB contains millions of structural alignments between all proteins in the PDB as of July 2005 found by the TOPOFIT method. Structural neighbors to the query protein structure can be retrieved by the PDB code and chain id. Pairwise or multiple structural alignments can be visualized in 3D with the FRIEND software. In the TOPOFIT method, similarity of protein structures is analyzed using three-dimensional Delaunay triangulation patterns derived from backbone representation. It has been found that structurally related proteins have a common spatial invariant part, a set of tetrahedrons, mathematically described as a common spatial sub-graph volume of the three-dimensional contact graph derived from Delaunay tessellation (DT). Based on this property of protein structures we present a novel common volume superimposition (TOPOFIT) method to produce structural alignments of proteins. The superimposition of the DT patterns allows one to uniquely identify a common number of equivalent residues in the structural alignment, in other words, TOPOFIT identifies a feature point on the RMSD/Ne curve, a topomax point, until which two structures correspond to each other including backbone and inter-residue contacts, while the growing number of mismatches between the DT patterns occurs at larger RMSD (Ne) after topomax point. The topomax point is present in all alignments from different protein structural classes; therefore, the TOPOFIT method identifies common, invariant structural parts between proteins. The TOPOFIT method adds new opportunities for the comparative analysis of protein structures and for more detailed studies on understanding the molecular principles of tertiary structure organization and functionality. It helps to detect conformational changes, topological differences in variable parts, which are particularly important for studies of variations in active/binding sites and protein classification.

  20. d

    Gulf of Maine - Water Salinity, Temperature and Sigma t (density) data from...

    • catalog.data.gov
    • gimi9.com
    • +2more
    Updated Oct 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (Point of Contact, Custodian) (2025). Gulf of Maine - Water Salinity, Temperature and Sigma t (density) data from 1981 to 2005 [Dataset]. https://catalog.data.gov/dataset/gulf-of-maine-water-salinity-temperature-and-sigma-t-density-data-from-1981-to-20051
    Explore at:
    Dataset updated
    Oct 30, 2025
    Dataset provided by
    (Point of Contact, Custodian)
    Area covered
    Gulf of Maine
    Description

    This table contains water salinity, temperature and sigma t (density) data from 1981 to 2005 binned at 10 meter depth intervals (from 300 meters up to 0 meters) for the Gulf of Maine. It was acquired from the Canadian Fisheries and Oceans Hydrographic database, which covers an area defined by 35 deg - 80 deg N and 42 deg - 100 deg W, and contains over 500,000 temperature-salinity profiles for the Northwest Atlantic from 1910 to the present. The data comes from a variety of sources including hydrographic bottles, CTD casts (either up or down casts), spatially and temporally averaged Batfish tows, and expendable, digital or mechanical bathythermographs. Near real-time data in the form of IGOSS Bathy or Tesac messages are also included. Updates are made monthly.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Tomasz Tunguz (2025). Why AI Can't Crack Your Database - Data Analysis [Dataset]. https://tomtunguz.com/spider-2-benchmark-trends/

Why AI Can't Crack Your Database - Data Analysis

Explore at:
Dataset updated
Aug 13, 2025
Dataset provided by
Theory Ventures
Authors
Tomasz Tunguz
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Explore why AI excels at complex math but struggles with SQL queries, with benchmark data showing a 60% accuracy ceiling in database operations across leading models.

Search
Clear search
Close search
Google apps
Main menu