100+ datasets found
  1. News Articles Dataset from Indian Express

    • kaggle.com
    zip
    Updated Jun 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pulkit Komal (2020). News Articles Dataset from Indian Express [Dataset]. https://www.kaggle.com/datasets/pulkitkomal/news-article-data-set-from-indian-express/code
    Explore at:
    zip(23052019 bytes)Available download formats
    Dataset updated
    Jun 8, 2020
    Authors
    Pulkit Komal
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset contains 20k news headlines, descriptions & articles from August 11, 2019 to June 8, 2020 obtained from Indian Express.

    Acknowledgements

    This dataset was obtained from www.indianexpress.com

    Content

    article_id: has generated article id's. headline: headline of the article. desc: description of the article date: date and time of the article url: url of the article articles: full article article_type: short, mid, long values to show the length of the article. article_length: ength of the article.

  2. Medium articles dataset

    • kaggle.com
    • crawlfeeds.com
    zip
    Updated May 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2021). Medium articles dataset [Dataset]. https://www.kaggle.com/crawlfeeds/medium-articles-dataset
    Explore at:
    zip(21800753 bytes)Available download formats
    Dataset updated
    May 9, 2021
    Authors
    Crawl Feeds
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Medium Articles dataset

    Medium is an American online publishing platform launched in August 2012. Crawl Feeds team extracted data from medium articles for research and analysis purposes.

    Fields

    Total fields: 15

    url, crawled_at, id, title, author, published_at, author_url, reading_time, total_claps, raw_description, source, description, tags, images, modified_at

    Get complete dataset from crawl feeds over more than 500K+ records Link

  3. Z

    CT-FAN: A Multilingual dataset for Fake News Detection

    • data.niaid.nih.gov
    • zenodo.org
    Updated Oct 23, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gautam Kishore Shahi; Julia Maria Struß; Thomas Mandl; Juliane Köhler; Michael Wiegand; Melanie Siegel (2022). CT-FAN: A Multilingual dataset for Fake News Detection [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4714516
    Explore at:
    Dataset updated
    Oct 23, 2022
    Dataset provided by
    University of Klagenfurt
    University of Applied Sciences Potsdam
    University of Duisburg-Essen
    University of Hildesheim
    Darmstadt University of Applied Sciences
    Authors
    Gautam Kishore Shahi; Julia Maria Struß; Thomas Mandl; Juliane Köhler; Michael Wiegand; Melanie Siegel
    Description

    By downloading the data, you agree with the terms & conditions mentioned below:

    Data Access: The data in the research collection may only be used for research purposes. Portions of the data are copyrighted and have commercial value as data, so you must be careful to use them only for research purposes.

    Summaries, analyses and interpretations of the linguistic properties of the information may be derived and published, provided it is impossible to reconstruct the information from these summaries. You may not try identifying the individuals whose texts are included in this dataset. You may not try to identify the original entry on the fact-checking site. You are not permitted to publish any portion of the dataset besides summary statistics or share it with anyone else.

    We grant you the right to access the collection's content as described in this agreement. You may not otherwise make unauthorised commercial use of, reproduce, prepare derivative works, distribute copies, perform, or publicly display the collection or parts of it. You are responsible for keeping and storing the data in a way that others cannot access. The data is provided free of charge.

    Citation

    Please cite our work as

    @InProceedings{clef-checkthat:2022:task3, author = {K{"o}hler, Juliane and Shahi, Gautam Kishore and Stru{\ss}, Julia Maria and Wiegand, Michael and Siegel, Melanie and Mandl, Thomas}, title = "Overview of the {CLEF}-2022 {CheckThat}! Lab Task 3 on Fake News Detection", year = {2022}, booktitle = "Working Notes of CLEF 2022---Conference and Labs of the Evaluation Forum", series = {CLEF~'2022}, address = {Bologna, Italy},}

    @article{shahi2021overview, title={Overview of the CLEF-2021 CheckThat! lab task 3 on fake news detection}, author={Shahi, Gautam Kishore and Stru{\ss}, Julia Maria and Mandl, Thomas}, journal={Working Notes of CLEF}, year={2021} }

    Problem Definition: Given the text of a news article, determine whether the main claim made in the article is true, partially true, false, or other (e.g., claims in dispute) and detect the topical domain of the article. This task will run in English and German.

    Task 3: Multi-class fake news detection of news articles (English) Sub-task A would detect fake news designed as a four-class classification problem. Given the text of a news article, determine whether the main claim made in the article is true, partially true, false, or other. The training data will be released in batches and roughly about 1264 articles with the respective label in English language. Our definitions for the categories are as follows:

    False - The main claim made in an article is untrue.

    Partially False - The main claim of an article is a mixture of true and false information. The article contains partially true and partially false information but cannot be considered 100% true. It includes all articles in categories like partially false, partially true, mostly true, miscaptioned, misleading etc., as defined by different fact-checking services.

    True - This rating indicates that the primary elements of the main claim are demonstrably true.

    Other- An article that cannot be categorised as true, false, or partially false due to a lack of evidence about its claims. This category includes articles in dispute and unproven articles.

    Cross-Lingual Task (German)

    Along with the multi-class task for the English language, we have introduced a task for low-resourced language. We will provide the data for the test in the German language. The idea of the task is to use the English data and the concept of transfer to build a classification model for the German language.

    Input Data

    The data will be provided in the format of Id, title, text, rating, the domain; the description of the columns is as follows:

    ID- Unique identifier of the news article

    Title- Title of the news article

    text- Text mentioned inside the news article

    our rating - class of the news article as false, partially false, true, other

    Output data format

    public_id- Unique identifier of the news article

    predicted_rating- predicted class

    Sample File

    public_id, predicted_rating 1, false 2, true

    IMPORTANT!

    We have used the data from 2010 to 2022, and the content of fake news is mixed up with several topics like elections, COVID-19 etc.

    Baseline: For this task, we have created a baseline system. The baseline system can be found at https://zenodo.org/record/6362498

    Related Work

    Shahi GK. AMUSED: An Annotation Framework of Multi-modal Social Media Data. arXiv preprint arXiv:2010.00502. 2020 Oct 1.https://arxiv.org/pdf/2010.00502.pdf

    G. K. Shahi and D. Nandini, “FakeCovid – a multilingual cross-domain fact check news dataset for covid-19,” in workshop Proceedings of the 14th International AAAI Conference on Web and Social Media, 2020. http://workshop-proceedings.icwsm.org/abstract?id=2020_14

    Shahi, G. K., Dirkson, A., & Majchrzak, T. A. (2021). An exploratory study of covid-19 misinformation on twitter. Online Social Networks and Media, 22, 100104. doi: 10.1016/j.osnem.2020.100104

    Shahi, G. K., Struß, J. M., & Mandl, T. (2021). Overview of the CLEF-2021 CheckThat! lab task 3 on fake news detection. Working Notes of CLEF.

    Nakov, P., Da San Martino, G., Elsayed, T., Barrón-Cedeno, A., Míguez, R., Shaar, S., ... & Mandl, T. (2021, March). The CLEF-2021 CheckThat! lab on detecting check-worthy claims, previously fact-checked claims, and fake news. In European Conference on Information Retrieval (pp. 639-649). Springer, Cham.

    Nakov, P., Da San Martino, G., Elsayed, T., Barrón-Cedeño, A., Míguez, R., Shaar, S., ... & Kartal, Y. S. (2021, September). Overview of the CLEF–2021 CheckThat! Lab on Detecting Check-Worthy Claims, Previously Fact-Checked Claims, and Fake News. In International Conference of the Cross-Language Evaluation Forum for European Languages (pp. 264-291). Springer, Cham.

  4. A study of the impact of data sharing on article citations using journal...

    • plos.figshare.com
    • dataverse.harvard.edu
    • +1more
    docx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Garret Christensen; Allan Dafoe; Edward Miguel; Don A. Moore; Andrew K. Rose (2023). A study of the impact of data sharing on article citations using journal policies as a natural experiment [Dataset]. http://doi.org/10.1371/journal.pone.0225883
    Explore at:
    docxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Garret Christensen; Allan Dafoe; Edward Miguel; Don A. Moore; Andrew K. Rose
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This study estimates the effect of data sharing on the citations of academic articles, using journal policies as a natural experiment. We begin by examining 17 high-impact journals that have adopted the requirement that data from published articles be publicly posted. We match these 17 journals to 13 journals without policy changes and find that empirical articles published just before their change in editorial policy have citation rates with no statistically significant difference from those published shortly after the shift. We then ask whether this null result stems from poor compliance with data sharing policies, and use the data sharing policy changes as instrumental variables to examine more closely two leading journals in economics and political science with relatively strong enforcement of new data policies. We find that articles that make their data available receive 97 additional citations (estimate standard error of 34). We conclude that: a) authors who share data may be rewarded eventually with additional scholarly citations, and b) data-posting policies alone do not increase the impact of articles published in a journal unless those policies are enforced.

  5. o

    NIH NCBI PubMed Central (PMC) Article Datasets - Full-Text Biomedical and...

    • registry.opendata.aws
    Updated Jul 4, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (NLM) (2021). NIH NCBI PubMed Central (PMC) Article Datasets - Full-Text Biomedical and Life Sciences Journal Articles on AWS [Dataset]. https://registry.opendata.aws/ncbi-pmc/
    Explore at:
    Dataset updated
    Jul 4, 2021
    Dataset provided by
    <a href="http://nlm.nih.gov/">National Library of Medicine (NLM)</a>
    Description

    PubMed Central® (PMC) is a free full-text archive of biomedical and life sciences journal article at the U.S. National Institutes of Health's National Library of Medicine (NIH/NLM). The PubMed Central (PMC) Article Datasets include full-text articles archived in PMC and made available under license terms that allow for text mining and other types of secondary analysis and reuse. The articles are organized on AWS based on general license type:

    The PMC Open Access (OA) Subset, which includes all articles in PMC with a machine-readable Creative Commons license

    The Author Manuscript Dataset, which includes all articles collected under a funder policy in PMC and made available in machine-readable formats for text mining

    These datasets collectively span more than half of PMC’s total collection of full-text articles. PMC enables access to these datasets to expand the impact of open access and publicly-funded research; enable greater machine learning across the spectrum of scientific research; reach new audiences; and open new doors for discovery. The bucket in this registry contains individual articles in NISO Z39.96-2015 JATS XML format as well as in plain text as extracted from the XML. The bucket is updated daily with new and updated articles. Also included are file lists that include metadata for articles in each dataset.

  6. f

    The data used in this article is provided in S1 Data.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    • +1more
    Updated Mar 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peng, Jiquan; Zhao, Yan; Li, Xijian; Wei, Zeng; Wang, Cheng (2025). The data used in this article is provided in S1 Data. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0002041736
    Explore at:
    Dataset updated
    Mar 21, 2025
    Authors
    Peng, Jiquan; Zhao, Yan; Li, Xijian; Wei, Zeng; Wang, Cheng
    Description

    The data used in this article is provided in S1 Data.

  7. Google Scholar Article Listing(Data Mining)

    • kaggle.com
    zip
    Updated Apr 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Anas Mahmood (2023). Google Scholar Article Listing(Data Mining) [Dataset]. https://www.kaggle.com/muhammadanasmahmood/google-scholar-article-listingdata-mining
    Explore at:
    zip(155055 bytes)Available download formats
    Dataset updated
    Apr 21, 2023
    Authors
    Muhammad Anas Mahmood
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset includes google scholar articles listing on data mining, this is very helpful in many educational research works. This dataset contains 936 unique entries. including title, description, author names, article link, cited by and related articles.

  8. d

    Data release for solar-sensor angle analysis subset associated with the...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data release for solar-sensor angle analysis subset associated with the journal article "Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States" [Dataset]. https://catalog.data.gov/dataset/data-release-for-solar-sensor-angle-analysis-subset-associated-with-the-journal-article-so
    Explore at:
    Dataset updated
    Nov 27, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Western United States, United States
    Description

    This dataset provides geospatial location data and scripts used to analyze the relationship between MODIS-derived NDVI and solar and sensor angles in a pinyon-juniper ecosystem in Grand Canyon National Park. The data are provided in support of the following publication: "Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States". The data and scripts allow users to replicate, test, or further explore results. The file GrcaScpnModisCellCenters.csv contains locations (latitude-longitude) of all the 250-m MODIS (MOD09GQ) cell centers associated with the Grand Canyon pinyon-juniper ecosystem that the Southern Colorado Plateau Network (SCPN) is monitoring through its land surface phenology and integrated upland monitoring programs. The file SolarSensorAngles.csv contains MODIS angle measurements for the pixel at the phenocam location plus a random 100 point subset of pixels within the GRCA-PJ ecosystem. The script files (folder: 'Code') consist of 1) a Google Earth Engine (GEE) script used to download MODIS data through the GEE javascript interface, and 2) a script used to calculate derived variables and to test relationships between solar and sensor angles and NDVI using the statistical software package 'R'. The file Fig_8_NdviSolarSensor.JPG shows NDVI dependence on solar and sensor geometry demonstrated for both a single pixel/year and for multiple pixels over time. (Left) MODIS NDVI versus solar-to-sensor angle for the Grand Canyon phenocam location in 2018, the year for which there is corresponding phenocam data. (Right) Modeled r-squared values by year for 100 randomly selected MODIS pixels in the SCPN-monitored Grand Canyon pinyon-juniper ecosystem. The model for forward-scatter MODIS-NDVI is log(NDVI) ~ solar-to-sensor angle. The model for back-scatter MODIS-NDVI is log(NDVI) ~ solar-to-sensor angle + sensor zenith angle. Boxplots show interquartile ranges; whiskers extend to 10th and 90th percentiles. The horizontal line marking the average median value for forward-scatter r-squared (0.835) is nearly indistinguishable from the back-scatter line (0.833). The dataset folder also includes supplemental R-project and packrat files that allow the user to apply the workflow by opening a project that will use the same package versions used in this study (eg, .folders Rproj.user, and packrat, and files .RData, and PhenocamPR.Rproj). The empty folder GEE_DataAngles is included so that the user can save the data files from the Google Earth Engine scripts to this location, where they can then be incorporated into the r-processing scripts without needing to change folder names. To successfully use the packrat information to replicate the exact processing steps that were used, the user should refer to packrat documentation available at https://cran.r-project.org/web/packages/packrat/index.html and at https://www.rdocumentation.org/packages/packrat/versions/0.5.0. Alternatively, the user may also use the descriptive documentation phenopix package documentation, and description/references provided in the associated journal article to process the data to achieve the same results using newer packages or other software programs.

  9. d

    Research Article: Breast Cancer Research : BCR

    • catalog.data.gov
    • data.virginia.gov
    Updated Sep 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institutes of Health (2025). Research Article: Breast Cancer Research : BCR [Dataset]. https://catalog.data.gov/dataset/research-article-breast-cancer-research-bcr
    Explore at:
    Dataset updated
    Sep 6, 2025
    Dataset provided by
    National Institutes of Health
    Description

    Background Disruption of the balance between apoptosis and proliferation is considered to be an important factor in the development and progression of tumours. In the present study we determined the in vivo cell kinetics along the spectrum of apparently normal epithelium, hyperplasia, preinvasive lesions and invasive carcinoma, in breast tissues affected by fibrocystic changes in which preinvasive and/or invasive lesions developed, as a model of breast carcinogenesis. Materials and methods A total of 32 areas of apparently normal epithelium and 135 ductal proliferative and neoplastic lesions were studied. More than one epithelial lesion per case were analyzed. The apoptotic index (AI) and the proliferative index (PI) were expressed as the percentage of TdT-mediated dUTP-nick end-labelling (TUNEL) and Ki-67-positive cells, respectively. The PI/AI (P/A index) was calculated for each case. Results The AIs and PIs were significantly higher in hyperplasia than in apparently normal epithelium (P = 0.04 and P = 0.0005, respectively), in atypical hyperplasia than in hyperplasia (P = 0.01 and P = 0.04, respectively) and in invasive carcinoma than in in situ carcinoma (P < 0.001 and P < 0.001, respectively). The two indices were similar in atypical hyperplasia and in in situ carcinoma. The P/A index increased significantly from normal epithelium to hyperplasia (P = 0.01) and from preinvasive lesions to invasive carcinoma (P = 0.04) whereas it was decreased (non-significantly) from hyperplasia to preinvasive lesions. A strong positive correlation between the AIs and the PIs was found (r = 0.83, P < 0.001). Conclusion These findings suggest accelerating cell turnover along the continuum of breast carcinogenesis. Atypical hyperplasias and in situ carcinomas might be kinetically similar lesions. In the transition from normal epithelium to hyperplasia and from preinvasive lesions to invasive carcinoma the net growth of epithelial cells results from a growth imbalance in favour of proliferation. In the transition from hyperplasia to preinvasive lesions there is an imbalance in favour of apoptosis.

  10. e

    List of Top Authors of Journal of Big Data sorted by article citations

    • exaly.com
    csv, json
    Updated Nov 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). List of Top Authors of Journal of Big Data sorted by article citations [Dataset]. https://exaly.com/journal/30122/journal-of-big-data/top-authors/most-cited
    Explore at:
    csv, jsonAvailable download formats
    Dataset updated
    Nov 1, 2025
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    List of Top Authors of Journal of Big Data sorted by article citations.

  11. f

    Tutorial-Articles: The Importance of Data and Code Sharing

    • scielo.figshare.com
    jpeg
    Updated Mar 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Henrique Castro Martins (2021). Tutorial-Articles: The Importance of Data and Code Sharing [Dataset]. http://doi.org/10.6084/m9.figshare.14320908.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Mar 26, 2021
    Dataset provided by
    SciELO journals
    Authors
    Henrique Castro Martins
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT Context: this document is designed to be along with those that are in the first edition of the new section of the Journal of Contemporary Administration (RAC): the tutorial-articles section. Objective: the purpose is to present the new section and discuss relevant topics of tutorial-articles. Method: I divide the document into three main parts. First, I provide a summary of the state of the art in open data and open code at the current date that, jointly, create the context for tutorial-articles. Second, I provide some guidance to the future of the section on tutorial-articles, providing a structure and some insights that can be developed in the future. Third, I offer a short R script to show examples of open data that, I believe, can be used in the future in tutorial-articles, but also in innovative empirical studies. Conclusion: finally, I provide a short description of the first tutorial-articles accepted for publication in this current RAC’s edition.

  12. Mathematics article counts, 2014.

    • plos.figshare.com
    xls
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ryan P. Womack (2023). Mathematics article counts, 2014. [Dataset]. http://doi.org/10.1371/journal.pone.0143460.t007
    Explore at:
    xlsAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Ryan P. Womack
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Mathematics article counts, 2014.

  13. Data from: Developing an effective market for Open Access Article Processing...

    • wellcome.figshare.com
    zip
    Updated Oct 21, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Robert Kiley (2016). Developing an effective market for Open Access Article Processing Charges [Dataset]. http://doi.org/10.6084/m9.figshare.951966.v2
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 21, 2016
    Dataset provided by
    Wellcome Trusthttps://wellcome.org/
    Authors
    Robert Kiley
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data set to support the Bjork and Solomon study "Developing an effective market for OA APC"

  14. H

    Supplementary material for Article "Reduction of Data-Value-Aware Process...

    • dataverse.harvard.edu
    • dataone.org
    Updated Nov 29, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elaheh Ordoni (2022). Supplementary material for Article "Reduction of Data-Value-Aware Process Models" [Dataset]. http://doi.org/10.7910/DVN/VG4NSK
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 29, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Elaheh Ordoni
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The supplementary material includes the original and reduced spectrum auction BPMN models, the original and the reduced spectrum auction Petri Nets, and the verification results for Article "Reduction of Data-Value-Aware Process Models based on Relevance".

  15. Microbiome differential across 38 datasets

    • kaggle.com
    zip
    Updated Jun 5, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    linze.yu (2022). Microbiome differential across 38 datasets [Dataset]. https://www.kaggle.com/datasets/linzey/microbiome-differential-across-38-datasets
    Explore at:
    zip(19485165 bytes)Available download formats
    Dataset updated
    Jun 5, 2022
    Authors
    linze.yu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Nearing, J. T. et al. Microbiome differential abundance methods produce different results across 38 datasets. Nat Commun 13, 342, doi:10.1038/s41467-022-28034-z (2022).
    https://www.nature.com/articles/s41467-022-28034-z
    Exercises that can be used to reproduce pictures of articles.

  16. c

    [Historical Only] 311 Top Articles Referenced (from Nov. 2017 to Oct 2023)

    • s.cnmilf.com
    • data.sfgov.org
    • +1more
    Updated Sep 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    data.sfgov.org (2025). [Historical Only] 311 Top Articles Referenced (from Nov. 2017 to Oct 2023) [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/311-top-articles-referenced-current-from-nov-2017-forward
    Explore at:
    Dataset updated
    Sep 27, 2025
    Dataset provided by
    data.sfgov.org
    Description

    This report tracks the name and source/website of the articles referenced for each month shown. Titles with a department name at the end reflect information provided from the department website. The other articles are maintained by SF311 to supplement department data. Articles used by 311 in the course of creating service requests (such as graffiti handling processes) are not included. Metrics of knowledge use from September and October 2017 was inconsistent following the migration to a new system in September 2017. Additionally, the names of articles is different than article names prior to the upgrade. This dataset is updated monthly. Note - This is not up-to-date as of June 2023. We are working to fully load the full data in the near future. For historical data (Top Articles Referenced through Aug. 2017), go to: https://data.sfgov.org/City-Infrastructure/311-Top-Articles-Referenced-historical-through-Aug/hrr4-hjc6/data

  17. Data for figures in article: Mutagenicity- and Pollutant-Emission Factors of...

    • catalog.data.gov
    • datasets.ai
    Updated Nov 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. EPA Office of Research and Development (ORD) (2020). Data for figures in article: Mutagenicity- and Pollutant-Emission Factors of Pellet-Fueled Cookstoves [Dataset]. https://catalog.data.gov/dataset/data-for-figures-in-article-mutagenicity-and-pollutant-emission-factors-of-pellet-fueled-c
    Explore at:
    Dataset updated
    Nov 12, 2020
    Dataset provided by
    United States Environmental Protection Agencyhttp://www.epa.gov/
    Description

    This dataset provides supporting information for figures in the journal article entitled: Mutagenicity- and Pollutant-Emission Factors of Pellet-Fueled Gasifier Cookstoves: Comparison with Other Combustion Sources. This dataset is associated with the following publication: Champion, W., S. Warren, I. Kooter, W. Preston, T. Krantz, D. DeMarini, and J. Jetter. Mutagenicity- and Pollutant-Emission Factors of Pellet-Fueled Gasifier Cookstoves: Comparison with Other Combustion Sources. SCIENCE OF THE TOTAL ENVIRONMENT. Elsevier BV, AMSTERDAM, NETHERLANDS, 739(October 15 2020): 139488, (2020).

  18. 4

    Data underlying the research of four scenarios in the operation of water...

    • data.4tu.nl
    • figshare.com
    zip
    Updated Apr 14, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Hang Wan (2021). Data underlying the research of four scenarios in the operation of water discharge patterns of a dam [Dataset]. http://doi.org/10.4121/14398946.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 14, 2021
    Dataset provided by
    4TU.ResearchData
    Authors
    Hang Wan
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Reservoir operation rules(1) continuous flood discharge with ecological priority(2) pulse flood discharge with ecological priority
    (3) pulse flood discharge with equal weight of ecology and power generation(4) pulse flood discharge with power generation priority

  19. m

    Western-blotting Data for Lactiplantibacillus plantarum EP21

    • data.mendeley.com
    Updated Nov 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chi-Fong Lin (2024). Western-blotting Data for Lactiplantibacillus plantarum EP21 [Dataset]. http://doi.org/10.17632/3m28wjgtdh.1
    Explore at:
    Dataset updated
    Nov 20, 2024
    Authors
    Chi-Fong Lin
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Western-blotting Data for Lactiplantibacillus plantarum EP21

  20. d

    Data from: Data release associated with the journal article "Solar and...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data release associated with the journal article "Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States" [Dataset]. https://catalog.data.gov/dataset/data-release-associated-with-the-journal-article-solar-and-sensor-geometry-not-vegetation-
    Explore at:
    Dataset updated
    Nov 19, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Western United States, United States
    Description

    This dataset supports the following publication: "Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States" (DOI:10.1016/j.rse.2020.112013). The data release allows users to replicate, test, or further explore results. The dataset consists of 4 separate items based on the analysis approach used in the original publication 1) the 'Phenocam' dataset uses images from a phenocam in a pinyon juniper ecosystem in Grand Canyon National Park to determine phenological patterns of multiple plant species. The 'Phenocam' dataset consists of scripts and tabular data developed while performing analyses and includes the final NDVI values for all areas of interest (AOIs) described in the associated publication. 2) the 'SolarSensorAnalysis' dataset uses downloaded tabular MODIS data to explore relationships between NDVI and multiple solar and sensor angles. The 'SolarSensorAnalysis' dataset consists of download and analysis scripts in Google Earth Engine and R. The source MODIS data used in the analysis are too large to include but are provided through MODIS providers and can be accessed through Google Earth Engine using the included script. A csv file includes solar and sensor angle information for the MODIS pixel closest to the phenocam as well as for a sample of 100 randomly selected MODIS pixels within the GRCA-PJ ecosystem. 3) the 'WinterPeakExtent' dataset includes final geotiffs showing the temporal frequency extent and associated vegetation physiognomic types experiencing winter NDVI peaks in the western US. 4) the "SensorComparison" dataset contains the NDVI time series at the phenocam location from 4 other satellites as well as the code used to download these data.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Pulkit Komal (2020). News Articles Dataset from Indian Express [Dataset]. https://www.kaggle.com/datasets/pulkitkomal/news-article-data-set-from-indian-express/code
Organization logo

News Articles Dataset from Indian Express

An easy to understand news article data set.

Explore at:
zip(23052019 bytes)Available download formats
Dataset updated
Jun 8, 2020
Authors
Pulkit Komal
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

Context

This dataset contains 20k news headlines, descriptions & articles from August 11, 2019 to June 8, 2020 obtained from Indian Express.

Acknowledgements

This dataset was obtained from www.indianexpress.com

Content

article_id: has generated article id's. headline: headline of the article. desc: description of the article date: date and time of the article url: url of the article articles: full article article_type: short, mid, long values to show the length of the article. article_length: ength of the article.

Search
Clear search
Close search
Google apps
Main menu