10 datasets found
  1. Data from: deepData

    • kaggle.com
    zip
    Updated Mar 2, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    David (2021). deepData [Dataset]. https://www.kaggle.com/laqwei/deepdata
    Explore at:
    zip(734389865 bytes)Available download formats
    Dataset updated
    Mar 2, 2021
    Authors
    David
    Description

    Dataset

    This dataset was created by David

    Contents

  2. DeepData Hackathon

    • kaggle.com
    Updated Oct 8, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nikhil (2025). DeepData Hackathon [Dataset]. https://www.kaggle.com/datasets/nik84810/deepdata-hackathon
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 8, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Nikhil
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Nikhil

    Released under MIT

    Contents

  3. w

    deepdata.guru - Historical whois Lookup

    • whoisdatacenter.com
    csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, deepdata.guru - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/deepdata.guru/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Nov 11, 2025
    Description

    Explore the historical Whois records related to deepdata.guru (Domain). Get insights into ownership history and changes over time.

  4. DeepData Hackathon InsightX

    • kaggle.com
    Updated Oct 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nirdesh Jain (2025). DeepData Hackathon InsightX [Dataset]. http://doi.org/10.34740/kaggle/dsv/13281488
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Nirdesh Jain
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    This dataset focuses on predicting drug–drug interaction (DDI) risks using machine learning techniques. It contains detailed information about various drugs, their pharmacological properties, molecular structures, and known interactions. The primary goal is to enable the development of predictive models that can identify potential adverse interactions between medications before they occur.

    Key Features:

    • Drug identifiers, names, and molecular descriptors
    • Known drug–drug interaction pairs and severity levels
    • Pharmacological classes and target information
    • Features suitable for supervised learning (classification/regression)

    Applications:

    • Predicting harmful drug combinations in healthcare
    • Enhancing clinical decision support systems
    • Supporting AI-driven pharmacovigilance research
  5. w

    deepdata.top - Historical whois Lookup

    • whoisdatacenter.com
    csv
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AllHeart Web Inc, deepdata.top - Historical whois Lookup [Dataset]. https://whoisdatacenter.com/domain/deepdata.top/
    Explore at:
    csvAvailable download formats
    Dataset authored and provided by
    AllHeart Web Inc
    License

    https://whoisdatacenter.com/terms-of-use/https://whoisdatacenter.com/terms-of-use/

    Time period covered
    Mar 15, 1985 - Nov 26, 2025
    Description

    Explore the historical Whois records related to deepdata.top (Domain). Get insights into ownership history and changes over time.

  6. t

    Data from: CeBed: A Benchmark for Deep Data-Driven OFDM Channel Estimation

    • service.tib.eu
    Updated Jan 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). CeBed: A Benchmark for Deep Data-Driven OFDM Channel Estimation [Dataset]. https://service.tib.eu/ldmservice/dataset/cebed--a-benchmark-for-deep-data-driven-ofdm-channel-estimation
    Explore at:
    Dataset updated
    Jan 2, 2025
    Description

    CeBed is a benchmark for deep data-driven OFDM channel estimation.

  7. k

    DeepData-META

    • dataon.kisti.re.kr
    Updated Mar 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Korea Institute of Science and Technology Information (2022). DeepData-META [Dataset]. https://dataon.kisti.re.kr/search/65e338a055f2b86f34d985fd30b8cefc
    Explore at:
    Dataset updated
    Mar 4, 2022
    Authors
    Korea Institute of Science and Technology Information
    Description

    데이터 개요 : * The data for extracting metadata from PDF domestic papers. * The data contains information in layout box extracted from each PDF paper with labels corresponding to metadata field types. * The information in each layout box are unique code, text, coordinates(x0, y0, x1, y1) of box, width of box, height of box and font size. * The file named as “train.txt” was constructed through the fully automatic inspection process. It contains a total of 5,241,746 labeled layout boxes for 295,306 papers in 503 journals. It was used as train set. * The file named as “valid.txt” was developed through the manual inspection process by several annotators. It contains a total of 155,629 labeled layout boxes for 9,895 papers in 503 journals. * The file named as “test.txt” was built through the manual inspection process. It contains a total of 159,925 labeled layout boxes for 10,119 papers in 503 journals. It was used as test set.데이터 설명 : The data for extracting metadata from PDF domestic papers.

  8. k

    DeepData-REFMETA

    • dataon.kisti.re.kr
    Updated Nov 23, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Korea Institute of Science and Technology Information (2021). DeepData-REFMETA [Dataset]. https://dataon.kisti.re.kr/search/7cb062e6f9904dabe77131271a70a795
    Explore at:
    Dataset updated
    Nov 23, 2021
    Authors
    Korea Institute of Science and Technology Information
    Description

    데이터 개요 : * The corpus for extracting metadata from multilingual journal references. * The corpus contains the tokens of reference string with IOB labels corresponding to metadata field types. * The file named as “automatic_inspection_data(train_set).txt” was constructed through the fully automatic inspection process. It contains a total of 3,680,620 labeled references, and was used as training set for training our BERT-based parsing model in our study. * The file named as “manual_insepction_data_(validation_set).txt” was developed through the manual inspection process by several annotators. It contains a total of 63,878 labeled references, and was used as validation set. * The file named as “manual_inspection_data_(test_set).txt” was built through the manual inspection process. It contains a total of 71,489 labeled references, and was used as test set. * In the files, each reference is separated by a newline character. In each first line of references, a unique identifier and language type are listed. 데이터 설명 : The corpus for extracting metadata from multilingual journal references

  9. Investigative needle core biopsies support multimodal deep-data generation...

    • data.niaid.nih.gov
    • ebi.ac.uk
    xml
    Updated May 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ryuhjin Ahn; Forest White (2025). Investigative needle core biopsies support multimodal deep-data generation in glioblastoma [Dataset]. https://data.niaid.nih.gov/resources?id=pxd046857
    Explore at:
    xmlAvailable download formats
    Dataset updated
    May 6, 2025
    Dataset provided by
    Massachusetts Institute of Technology
    Biological Engineering, MIT
    Authors
    Ryuhjin Ahn; Forest White
    Variables measured
    Proteomics
    Description

    Glioblastoma (GBM) is a primary brain cancer with an abysmal prognosis with few effective therapies. The ability to investigate the tumor microenvironment before and during treatment would greatly enhance both our understanding of disease response and progression, as well as the delivery and impact of therapeutics. Stereotactic biopsies are a routine surgical procedure performed primarily for diagnostic histopathologic purposes. The adaptation of stereotactic biopsy tissue for complex and integrated investigative multi-modal molecular analyses (‘Multi-omics”) in the context of GBM regional heterogeneity is not routinely performed, Most of the tissue is consumed with standard of care clinical testing and the amount and quality of remaining cells, particularly in the context of recurrent GBMs that have failed previous treatments, has not previously been shown to be amenable to complex analyses. Here we performed highly resolved multi-modal analysis methods including single cell RNA sequencing, spatial-transcriptomics, metabolomics, proteomics, phosphoproteomics, T-cell clonal analysis, and immunopeptidomics on needle biopsy cores obtained from a single patient during the same procedure. In a second patient, we analyzed multi-regional core biopsies to decipher spatially associated tissue and genomic variance. Finally in a separate cohort of patients we investigated the utility of stereotactic biopsies as a method for generating patient derived xenograft models. Dataset integration across modalities showed good correspondence between spatial modalities and revealed tumor and immune cell associated metabolic profiles and cell signalling pathways. In conclusion, stereotactic needle biopsy cores are of sufficient quality for the purposes of investigative biopsy and can generate multi-omics data, providing data rich insight into a patient’s disease to interrogate the tumor immune microenvironment.

  10. Github Indian users deep data

    • kaggle.com
    zip
    Updated Oct 22, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Archit Tyagi (2024). Github Indian users deep data [Dataset]. https://www.kaggle.com/datasets/architty108/github-indian-users-deep-data/code
    Explore at:
    zip(1694479 bytes)Available download formats
    Dataset updated
    Oct 22, 2024
    Authors
    Archit Tyagi
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    India
    Description

    This dataset provides a rich snapshot of GitHub users from India, capturing various aspects of their public profiles. It's a valuable resource for analyzing trends in coding activity, repository management, and user engagement within the Indian developer community. Whether you're interested in exploring how developers grow their followers, examining language preferences, or identifying patterns in contributions and achievements, this dataset offers multiple points of analysis.

    Key Features: - Username: GitHub usernames of the individuals. - Gender Pronoun: Preferred gender pronouns (if available). - Followings: Number of people each user follows. - Joining Year: The year they joined GitHub. - Contributions: Number of contributions made in the last year. - Achievements: Number of GitHub achievements unlocked by the user. - Stars: Total number of stars on their repositories. - Repositories: Number of repositories created. - Followers: Number of followers each user has. - Location: User location details, primarily from India. - Languages: Primary programming language used by the individual. - Social Links: Links to their other social platforms (LinkedIn, personal websites, etc.). - Sorting Type: Categorized based on followers, repositories, or recent joining.

    This dataset can be used for: - Profiling the Indian developer community. - Tracking open-source contributions and achievements. - Analyzing programming language preferences and repository management. - Exploring the relationship between social followings and coding contributions.

    Perfect for data science, social network analysis, and open-source research.

  11. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
David (2021). deepData [Dataset]. https://www.kaggle.com/laqwei/deepdata
Organization logo

Data from: deepData

Related Article
Explore at:
192 scholarly articles cite this dataset (View in Google Scholar)
zip(734389865 bytes)Available download formats
Dataset updated
Mar 2, 2021
Authors
David
Description

Dataset

This dataset was created by David

Contents

Search
Clear search
Close search
Google apps
Main menu