11 datasets found
  1. BBC News Dataset – February 2023 Edition

    • crawlfeeds.com
    csv, zip
    Updated Jun 14, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Crawl Feeds (2025). BBC News Dataset – February 2023 Edition [Dataset]. https://crawlfeeds.com/datasets/bbc-news-dataset-feb-2023
    Explore at:
    zip, csvAvailable download formats
    Dataset updated
    Jun 14, 2025
    Dataset authored and provided by
    Crawl Feeds
    License

    https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

    Description

    Get access to a comprehensive and structured dataset of BBC News articles, freshly crawled and compiled in February 2023. This collection includes 1 million records from one of the world’s most trusted news organizations — perfect for training NLP models, sentiment analysis, and trend detection across global topics.

    💾 Format: CSV (available in ZIP archive)

    📢 Status: Published and available for immediate access

    Use Cases

    • Train language models to summarize or categorize news

    • Detect media bias and compare narrative framing

    • Conduct research in journalism, politics, and public sentiment

    • Enrich news aggregation platforms with clean metadata

    • Analyze content distribution across categories (e.g. health, politics, tech)

    This dataset ensures reliable and high-quality information sourced from a globally respected outlet. The format is optimized for quick ingestion into your pipelines — with clean text, timestamps, image links, and more.

    Need a filtered dataset or want this refreshed for a later date? We offer on-demand news scraping as well.

    👉 Request access or sample now

  2. BBC Datasets

    • brightdata.com
    .json, .csv, .xlsx
    Updated Nov 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bright Data (2023). BBC Datasets [Dataset]. https://brightdata.com/products/datasets/bbc
    Explore at:
    .json, .csv, .xlsxAvailable download formats
    Dataset updated
    Nov 12, 2023
    Dataset authored and provided by
    Bright Datahttps://brightdata.com/
    License

    https://brightdata.com/licensehttps://brightdata.com/license

    Area covered
    Worldwide
    Description

    Unlock the full potential of BBC broadcast data with our comprehensive dataset featuring transcripts, program schedules, headlines, topics, and multimedia resources. This all-in-one dataset is designed to empower media analysts, researchers, journalists, and advocacy groups with actionable insights for media analysis, transparency studies, and editorial assessments.

    Dataset Features

    Transcripts: Access detailed broadcast transcripts, including headlines, content, author details, and publication dates. Perfect for analyzing media framing, topic frequency, and news narratives across various programs. Program Schedules: Explore program schedules with accurate timing, show names, and related metadata to track news coverage patterns and identify trends. Topics and Keywords: Analyze categorized topics and keywords to understand content diversity, editorial focus, and recurring themes in news broadcasts. Multimedia Content: Gain access to videos, images, and related articles linked to each broadcast for a holistic understanding of the news presentation. Metadata: Includes critical data points like publication dates, last updates, content URLs, and unique IDs for easier referencing and cross-analysis.

    Customizable Subsets for Specific Needs Our CNN dataset is fully customizable to match your research or analytical goals. Focus on transcripts for in-depth media framing analysis, extract multimedia for content visualization studies, or dive into program schedules for broadcast trend analysis. Tailor the dataset to ensure it aligns with your objectives for maximum efficiency and relevance.

    Popular Use Cases

    Media Analysis: Evaluate news framing, content diversity, and topic coverage to assess editorial direction and media focus. Transparency Studies: Analyze journalistic standards, corrections, and retractions to assess media integrity and accountability. Audience Engagement: Identify recurring topics and trends in news content to understand audience preferences and behavior. Market Analysis: Track media coverage of key industries, companies, and topics to analyze public sentiment and industry relevance. Journalistic Integrity: Use transcripts and metadata to evaluate adherence to reporting practices, fairness, and transparency in news coverage. Research and Scholarly Studies: Leverage transcripts and multimedia to support academic studies in journalism, media criticism, and political discourse analysis.

    Whether you are evaluating transparency, conducting media criticism, or tracking broadcast trends, our BBC dataset provides you with the tools and insights needed for in-depth research and strategic analysis. Customize your access to focus on the most relevant data points for your unique needs.

  3. A

    ‘The Lost Journalists: Dataset of journalist deaths’ analyzed by Analyst-2

    • analyst-2.ai
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com), ‘The Lost Journalists: Dataset of journalist deaths’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-the-lost-journalists-dataset-of-journalist-deaths-eb66/f982f2d4/?iid=004-940&v=presentation
    Explore at:
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘The Lost Journalists: Dataset of journalist deaths’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/yamqwe/journalist-deathse on 13 February 2022.

    --- Dataset description provided by original source is as follows ---

    Credit for the original dataset goes to CPJ

    About this dataset

    In-the-News:

    https://data.world/api/journalism/dataset/journalist-deaths/file/raw/journalist_deaths_by_year.png" alt="journalist_deaths_by_year.png">

    Methodology

    CPJ began compiling detailed records on journalist deaths in 1992. We apply strict journalistic standards when investigating a death. One important aspect of our research is determining whether a death was work-related. As a result, we classify deaths as "motive confirmed" or "motive unconfirmed."

    We consider a case "confirmed" only if we are reasonably certain that a journalist was murdered in direct reprisal for his or her work; was killed in crossfire during combat situations; or was killed while carrying out a dangerous assignment such as coverage of a street protest. We do not include journalists who are killed in accidents such as car or plane crashes.

    We include only confirmed cases in the statistical analyses in this database.

    When the motive is unclear, but it is possible that a journalist was killed because of his or her work, CPJ classifies the case as "unconfirmed" and continues to investigate. We regularly reclassify cases based on our ongoing research.

    Our archives include narrative capsules of all journalists killed, including the cases in which the motive is unconfirmed. In cases where the place of death is incidental to the journalist's killing, we have listed the country where the fatal attack occurred to be the place of the journalist's death (for example, in a case where a journalist is hit by shrapnel in one country and evacuated to another, where he or she dies, CPJ lists the country in which he or she was hit as the place of death).

    CPJ defines journalists as people who cover news or comment on public affairs through any media -- including in print, in photographs, on radio, on television, and online. We take up cases involving staff journalists, freelancers, stringers, bloggers, and citizen journalists. The combination of daily reporting and statistical data forms the basis of our case-driven and long-term advocacy.

    In 2003, CPJ began documenting the deaths of media support workers. We did so in recognition of the vital role these individuals play in newsgathering. These workers include translators, drivers, fixers, and administrative workers.

    Our archives include narrative capsules for media workers killed on duty. These cases are not included our statistical analyses.

    About CPJ

    The Committee to Protect Journalists is an independent, nonprofit organization that promotes press freedom worldwide. We defend the right of journalists to report the news without fear of reprisal.

    Additional Reading
    Investigative journalism in Africa – “Walking through a minefield at midnight”
    Iraq: The deadliest war for journalists
    Being a journalist in Mexico is getting even more dangerous

    Source: Committee to Protect Journalists

    This dataset was created by Journalism, News, and Media and contains around 2000 samples along with Date, Unnamed: 18, technical information and other features such as: - Local/ Foreign - Unnamed: 20 - and more.

    How to use this dataset

    • Analyze Coverage in relation to Taken Captive
    • Study the influence of Organization on Unnamed: 21
    • More datasets

    Acknowledgements

    If you use this dataset in your research, please credit Journalism, News, and Media

    Start A New Notebook!

    --- Original source retains full ownership of the source dataset ---

  4. Z

    BioPropaPhenKG Towards Monkeypox and COVID-19 Case Tracing and Analysing

    • data.niaid.nih.gov
    • zenodo.org
    Updated Apr 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    H. A. Medeiros, Gabriel (2024). BioPropaPhenKG Towards Monkeypox and COVID-19 Case Tracing and Analysing [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_10987742
    Explore at:
    Dataset updated
    Apr 17, 2024
    Dataset authored and provided by
    H. A. Medeiros, Gabriel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains:

    The BioPropaPhen ontology created from PropaPhen, being specialized with UMLS and World Knowledge Graph ontologies;

    A neo4j 4.4.3 dump file of the BioPropaPhenKG knowledge graph with WHO ground truth data about COVID-19 and Monkeypox, and enhanced presence edges between UMLS entities to World KG entities for evaluating the Description-Detection-Prediction Framework

    The datasets used for enhancing the KG are:

    Phenomenon Dataset Period Documents Source Link

    COVID-19 Aylien Nov-2019 8 Online News ttps://aylien.com/resources/datasets/coronavirus-dataset

    COVID-19 CORD-19 Dec-2019 720 Medical Articles https://allenai.org/data/cord-19

    COVID-19 RedditCOVID Feb-2020 4,980 Social Media https://paperswithcode.com/dataset/the-reddit-covid-dataset

    Monkeypox Mined from BBC May-2022 27 Online News

    Monkeypox Mined from Pubmed June-2022 36 Medical Articles

    Monkeypox MonkeyPox2022 May-2022 33,826 Social Media https://doi.org/10.3390/idr14060087

  5. c

    Content analysis of fact checking and TV news during 2021: data

    • research-data.cardiff.ac.uk
    zip
    Updated Oct 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stephen Cushion (2024). Content analysis of fact checking and TV news during 2021: data [Dataset]. http://doi.org/10.17035/d.2023.0248123742
    Explore at:
    zipAvailable download formats
    Dataset updated
    Oct 30, 2024
    Dataset provided by
    Cardiff University
    Authors
    Stephen Cushion
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is based on content analysis of news media across different UK public service media organisations between April and July 2021 with a focus on the way political claims were scrutinised across broadcast news and fact-checking platforms. To establish a comparison between online fact-checking and broadcast news, we first constructed and examined a comprehensive sample of fact-checking items systematically retrieved from the top three UK fact-checking organisations between 20 April – 31 July 2021. This resulted in N=355 items across BBC Reality Check (N=118, 33.2%), C4 FactCheck (N=25, 7%) and the independent organisation Full Fact (N=212, 59,7%). The sample of N=355 online fact-checking items represented the base on which we built our comparative content analysis study of fact-checking and broadcast coverage of claims. The chosen timeframe reflects a period of coverage which was no longer heavily and almost entirely driven by the coronavirus pandemic – although the significant persistence of pandemic coverage provided potentially interesting case studies with regards to disinformation. Each news item published on the platforms within the sampled period (including weekends) was included for analysis. To establish a comparison between fact-checking and television news, a broadcast sample was constructed with the purpose to identify potentially matching political claims featuring in both broadcast and fact-checking news. This would then allow a comparative assessment of how much scrutiny every claim received on broadcast and online coverage. To construct the broadcast sample, all stories reported by the online fact-checkers (BBC Reality Check, C4 FactCheck and Full Fact) within the sample period (N=355) were searched for on Box of Broadcasts across BBC News at Ten and Channel 4 News bulletins on the same day and across the preceding and following week of coverage. Only those TV news items which matched the story reported on the fact-checking articles were included in the sample. This was carried out in order to achieve a sample of the same stories between the two platforms of online fact-checks and televised news broadcasts. We could then subsequently establish whether the same political claim featured in both fact-checking and broadcast and finally examine how the claim was scrutinised across different platforms.

  6. Country-wise weather data for covid19

    • kaggle.com
    Updated Apr 2, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sudhir Kakumanu (2020). Country-wise weather data for covid19 [Dataset]. https://www.kaggle.com/ksudhir/weather-data-countries-covid19/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 2, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sudhir Kakumanu
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Context

    COVID-19 CORONAVIRUS PANDEMIC has over 1 million cases worldwide. This dataset is created in an attempt to uncover if there is a co-relation of the country wise weather parameters with growing number of cases day by day.

    Many questions raised on the effects of Seasonality to SARS-CoV-2.

    According to the officials of WHO, press conference transcript on 05-mar-2020 speaker Dr Maria van Kerkhove answered - "so we’ve had some questions previously about what this virus will do in different climates, in different temperatures ?"

    We have no reason to believe that this virus would behave differently in different temperatures. We have no reason to believe that this virus would behave differently in different temperatures, which is why we want aggressive action in all countries to make sure that we prevent onward transmission, and that it’s taken seriously in every country. But this is something that will be of interest. We have the... In the northern hemisphere we have the flu season, which was ending fairly soon, and in the southern hemisphere we’ll have the flu season starting. And so it will be interesting to see what will happen in the northern hemisphere and the southern hemisphere. But to look at seasonality you need to look at patterns over time, and we do need some of that time to be able to see what happens. So it’s important that we aggressively look for cases, and so that we can understand the extent of infection and how the virus behaves in different populations.

    Some believe temperature will play a role in the outbreak but that the subject was worth investigating. Few studies by Harward CSPH, BBC, Bloomberg, Centre for Evidence-Based Medicine develops

    Content

    Basic weather parameters like, min/max temperature and humidity captured since 1/22/2020. Each country has three rows defining the weather parameters over the time. The structure is kept to be inline with Data Repository by Johns Hopkins CSSE.

    Acknowledgements

    Country names are picked from: https://github.com/CSSEGISandData/COVID-19

    Inspiration

    https://github.com/kakumanu-sudhir/covid19/tree/master/weather_data_extraction The data begins with the first reported coronavirus case on Jan. 21, 2020. I plan to publish regular updates (weekly twice till WK23) to the data in this repository.

  7. e

    Attitudes Toward RIAS (June 1954/I) - Dataset - B2FIND

    • b2find.eudat.eu
    Updated Nov 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Attitudes Toward RIAS (June 1954/I) - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/f480d72e-3860-5ca2-b2b5-38508314cee9
    Explore at:
    Dataset updated
    Nov 2, 2023
    Description

    Attitudes to the radio station RIAS. Topics: listening to the radio in general; listening to West German and East German stations; quality of reception of the station; improvement in reception quality; preferred station; listening to RIAS; listening to the radio on Weekdays and Sundays; amount of listening to the radio per week; judgement on the program of RIAS; reasons for the judgement on the program; RIAS as information station; substance of news broadcasts of RIAS; reports about weaknesses and strengths of the USA by RIAS; denying weaknesses of the USA by RIAS; listening to English-language stations; frequency of listening to English-language stations; knowledge and listening to AFN; quality of reception of AFN; listening to AFN in the GDR; preferred station in case of a political crisis; listening to certain programs in the last few years and earlier; reliability of news of the BBC and the Voice of America in comparison; times of listening to the Voice of America; listening to certain programs of the Voice of America; frequency of listening to the Voice of America; change of reporting of the Voice of America; evaluation of change of the Voice of America; changes of the Voice of America regarding propaganda; importance of Voice of America; benefit of broadcasts of the Voice of America; better understanding of American policies from the Voice of America; better understanding of American culture from the Voice of America; change of personal views about world politics from the Voice of America; substance of the news of the Voice of America; report about weaknesses and strengths of the USA by the Voice of America; denial of weaknesses of the USA by the Voice of America; missing the Voice of America at termination of the program. Demography: age; occupation; school education; sex; state; city size; number of stays in West Berlin; FDJ membership. Einstellungen zum Radiosender RIAS. Themen: Radiokonsum allgemein; Hören von west-und ostdeutschen Sendern; Empfangsqualität der Sender; Verbesserung der Empfangsqualität; bevorzugter Sender; Hören von RIAS; Radiokonsum an Werk- und Sonntagen; Radiokonsum pro Woche; Beurteilung des Programms des RIAS; Gründe für die Beurteilung des Programms; RIAS als Informationssender; Wahrheitsgehalt der Nachrichtensendungen von RIAS; Berichte über die Schwächen und Stärken der USA durch RIAS; Leugnen von Schwächen der USA durch den RIAS; Hören englischsprachiger Sender; Häufigkeit des Hörens englischsprachiger Sender; Kenntnis und Hören von AFN; Empfangsqualität des AFN; Hören des AFN in der DDR; bevorzugter Sender in einem politischen Krisenfall; Hören bestimmter Programme in den letzten Jahren und früher; Zuverlässigkeit der Nachrichten der BBC und der Stimme Amerikas im Vergleich; Uhrzeiten, zu denen die Stimme Amerikas gehört wird; Hören bestimmter Programme von der Stimme Amerikas; Häufigkeit des Hörens der Stimme Amerikas; Veränderung der Berichterstattung der Stimme Amerikas; Bewertung der Veränderung der Stimme Amerikas; Veränderungen der Stimme Amerikas bezüglich Propaganda; Wichtigkeit der Stimme Amerikas; Nutzen der Sendungen der Stimme Amerikas; besseres Verständnis der amerikanischen Politik durch die Stimme Amerikas; besseres Verständnis der amerikanischen Kultur durch die Stimme Amerikas; Änderung der persönlichen Ansichten über die Weltpolitik durch die Stimme Amerikas; Wahrheitsgehalt der Nachrichten der Stimme Amerikas; Bericht über die Schwächen und Stärken der USA durch die Stimme Amerikas; Leugnen von Schwächen der USA durch die Stimme Amerikas; Vermissen der Stimme Amerikas bei Einstellung des Programms. Demographie: Alter; Beruf; Schulbildung; Geschlecht; Land; Ortsgröße; Anzahl der Aufenthalte in Westberlin; FDJ-Mitgliedschaft.

  8. e

    Attitudes Toward RIAS (June 1954/I) - Dataset - B2FIND

    • b2find.eudat.eu
    Updated Jun 12, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Attitudes Toward RIAS (June 1954/I) - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/0266d1c5-dac7-5ff7-a79b-bd3295062099
    Explore at:
    Dataset updated
    Jun 12, 2023
    Description

    Einstellungen zum Radiosender RIAS. Themen: Radiokonsum allgemein; Hören von west-und ostdeutschen Sendern;Empfangsqualität der Sender; Verbesserung der Empfangsqualität;bevorzugter Sender; Hören von RIAS; Radiokonsum an Werk- und Sonntagen;Radiokonsum pro Woche; Beurteilung des Programms des RIAS; Gründe fürdie Beurteilung des Programms; RIAS als Informationssender;Wahrheitsgehalt der Nachrichtensendungen von RIAS; Berichte über dieSchwächen und Stärken der USA durch RIAS; Leugnen von Schwächen der USAdurch den RIAS; Hören englischsprachiger Sender; Häufigkeit des Hörensenglischsprachiger Sender; Kenntnis und Hören von AFN;Empfangsqualität des AFN; Hören des AFN in der DDR; bevorzugterSender in einem politischen Krisenfall; Hören bestimmter Programme inden letzten Jahren und früher; Zuverlässigkeit der Nachrichten der BBCund der Stimme Amerikas im Vergleich; Uhrzeiten, zu denen die StimmeAmerikas gehört wird; Hören bestimmter Programme von der StimmeAmerikas; Häufigkeit des Hörens der Stimme Amerikas; Veränderung derBerichterstattung der Stimme Amerikas; Bewertung der Veränderung derStimme Amerikas; Veränderungen der Stimme Amerikas bezüglich Propaganda;Wichtigkeit der Stimme Amerikas; Nutzen der Sendungen der StimmeAmerikas; besseres Verständnis der amerikanischen Politik durch dieStimme Amerikas; besseres Verständnis der amerikanischen Kultur durchdie Stimme Amerikas; Änderung der persönlichen Ansichten über dieWeltpolitik durch die Stimme Amerikas; Wahrheitsgehalt der Nachrichtender Stimme Amerikas; Bericht über die Schwächen und Stärken der USAdurch die Stimme Amerikas; Leugnen von Schwächen der USA durch dieStimme Amerikas; Vermissen der Stimme Amerikas bei Einstellung desProgramms. Demographie: Alter; Beruf; Schulbildung; Geschlecht; Land; Ortsgröße;Anzahl der Aufenthalte in Westberlin; FDJ-Mitgliedschaft. Attitudes to the radio station RIAS.Topics:listening to the radio in general;listening to West German and East German stations;quality of reception of the station;improvement in reception quality;preferred station;listening to RIAS;listening to the radio on Weekdays and Sundays;amount of listening to the radio per week;judgement on the program of RIAS;reasons for the judgement on the program;RIAS as information station;substance of news broadcasts of RIAS;reports about weaknesses and strengths of the USA by RIAS;denying weaknesses of the USA by RIAS;listening to English-language stations;frequency of listening to English-language stations;knowledge and listening to AFN;quality of reception of AFN;listening to AFN in the GDR;preferred station in case of a political crisis;listening to certain programs in the last few years and earlier;reliability of news of the BBC and the Voice of America in comparison;times of listening to the Voice of America;listening to certain programs of the Voice of America;frequency of listening to the Voice of America;change of reporting of the Voice of America;evaluation of change of the Voice of America;changes of the Voice of America regarding propaganda;importance of Voice of America;benefit of broadcasts of the Voice of America;better understanding of American policies from the Voice of America;better understanding of American culture from the Voice of America;change of personal views about world politics from the Voice of America;substance of the news of the Voice of America;report about weaknesses and strengths of the USA by the Voice of America;denial of weaknesses of the USA by the Voice of America;missing the Voice of America at termination of the program.Demography:age;occupation;school education;sex;state;city size;number of stays in West Berlin;FDJ membership.

  9. e

    Television framing of the 2014 Scottish independence referendum - Part 3:...

    • b2find.eudat.eu
    Updated Jan 23, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2015). Television framing of the 2014 Scottish independence referendum - Part 3: Coding of news sources - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/8e100e13-9e03-5c47-84a2-20282277e3de
    Explore at:
    Dataset updated
    Jan 23, 2015
    Area covered
    Scotland
    Description

    This dataset contains the coding of the sources that appeared or were openly referenced in all news items about the 2014 Scottish independence referendum which were broadcast on BBC Reporting Scotland between 18 August and 18 September 2014. The file records the name of each source, the duration of their appearance or quotation, their gender, the side they supported in the referendum, the source category they belonged in (elite official; expert; non-elite official; unofficial; confidential; unaccounted), whether they were interviewed or paraphrased; whether they were identified by name or in generic terms; whether they were used once or multiple times in the same item; and whether they proposed new arguments or responded to someone else's. The news programmes themselves are available from the broadcaster. These data complement the interview dataset and the frame analysis dataset by showing which sources were used in the news coverage, and were therefore given explicitly the opportunity to promote their own frames of what the referendum was about. The other two datasets(See Related resources below) explore which frames were present in the coverage and how these frames emerged based on the experiences of broadcasters and their political and civil society sources.On 18 September 2014, the Scottish electorate will be called to answer a fundamental question about the future of the UK and Scotland: the decision of whether Scotland will become an independent state or remain a part of the UK will have an impact not only on the relationship between the British nations but also on other parts of Europe with similar concerns. Yet, as is the case with any contested issue, the definition of what this referendum is about will be negotiated between political and social groups, debated in the media and deliberated by voters before making their decision. Is the referendum a competition between two opponents fighting for the vote? Is it a matter of identity (shared or distinctive)? Is it a matter of economic survival and growth? This research will examine how the 2014 Scottish independence referendum campaign is framed in the news coverage of the two main television channels catering for audiences in Central Scotland, BBC Scotland and STV. The importance of television as a trusted source of news on political issues is constantly reaffirmed by surveys (Ofcom, 2013, Eurobarometer, 2012) and therefore what television says about a major political event is significant. The study will focus on Scottish news and current affairs coverage referring to the referendum in the final month of the campaign, create an original set of frames emerging from the coverage and measure which of them were more prominent. The project will also use structured interviews with political editors, heads of news and current affairs, political and civil society actors, to discuss how these representations were shaped in the interaction between journalists, media organisations and their sources. The project will contribute to public analysis of the news coverage of the referendum in the aftermath of the event and create opportunities for stakeholders to discuss how broadcasting contributes to the democratic process, through the way it reports on campaigns. All the items specified above were watched and coded for the sources that appeared or were mentioned during each programme. The categories into which sources were classified were as follows. Elite official sources: political and state institutions, official political campaigns, major corporate, business and economic organisations, major NGOs, celebrities, royalty, news agencies and other news media. Non-elite official sources: smaller non-profit and non-governmental organisations (charities, voluntary organisations, associations, societies, communities), interest, activist and pressure groups, trade unions, small businesses. Experts: academics and scientists, observers and specialists, analysts, think tanks, former politicians, former public officials. Unofficial sources: ordinary people, voters, workers (lower level staff), vox populi, survey respondents, protesters, demonstrators, rioters, hecklers, observers and participants in unusual activities. Confidential sources: unnamed, e.g. according to 'well-informed sources'.

  10. e

    Content and Framing Study of United Kingdom Media Coverage of the Iraq War,...

    • b2find.eudat.eu
    Updated Oct 21, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Content and Framing Study of United Kingdom Media Coverage of the Iraq War, 2003 - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/a0626780-7179-5d64-92f6-46542c30e356
    Explore at:
    Dataset updated
    Oct 21, 2023
    Area covered
    Iraq, United Kingdom
    Description

    Abstract copyright UK Data Service and data collection copyright owner. The purpose of this project was to evaluate media performance during the 2003 Iraq War. The war provided a fascinating case study, creating unprecedented levels of popular and political dissent, while questions surrounding media coverage generated accusations of media bias. Through analysing the success of media at maintaining autonomy and balance, this project provided research-based evidence to inform on-going public and political debates regarding the media's role during this conflict. A combined content and framing analysis of both UK TV news coverage and UK press enabled the researchers to assess, in great detail, how media reported the war. The breadth and depth of analysis far exceeds other equivalent studies. The analysis included four principal TV news programmes (from BBC, ITV, Sky News and Channel Four) and seven national daily newspapers and their Sunday equivalents (Daily Telegraph, The Times, The Guardian/The Observer, The Independent, The Daily Mail, The Mirror, The Sun/News of the World), thus enabling a thorough assessment of the quality of the UK public sphere during the conflict. With the story as the unit of analysis, media reports were systematically analysed in multiple ways, including documentation of story length, format (from a range of types of newspaper story or TV news report), use of new technology (e.g. video-phone), subject matter, sources quoted and cited, use of visuals or photographs, etc. Reports were also assessed for their tone toward the main actors in the conflict whilst a detailed framing analysis provided measures of more subtle forms of media bias. A key aim of the research was to identify the contours of framing in British TV and newspaper news of the war, uncovering the range, autonomy and boundaries to debate across media outlets, the extent to which news coverage reflected elite sources as well as dissenting voices, and the relative salience of justifications for the war.

  11. e

    Television framing of the 2014 Scottish independence referendum - Part 2:...

    • b2find.eudat.eu
    Updated Oct 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2023). Television framing of the 2014 Scottish independence referendum - Part 2: Coding of frames in television programmes - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/8f07beed-6a64-5340-9b45-5c487e148b13
    Explore at:
    Dataset updated
    Oct 21, 2023
    Area covered
    Scotland
    Description

    This dataset contains the coding of the frames that appeared in all news and current affairs items about the 2014 Scottish independence referendum which were produced for a Scottish audience (i.e. excluding the UK-wide coverage of the referendum) and were broadcast on BBC Scotland and STV between 18 August and 18 September 2014. The file records the date, duration, channel, and type of each item in this coverage and whether or not the following frames were present: policy, strategic game, social justice, divorce, democratic achievement, constitutional change, national division, self determination, and national identity. The programmes themselves are available from the respective broadcasters. These data complement the interview data(see Related resources below), which also form part of this project, as the quantitative data show which frames emerged in the television coverage and the interview data what factors may have influenced their creation. These data also complement the sources analysis data(see Related resources below) as those demonstrate which actors were used as sources in the news and, therefore, which actors were explicitly given access to television space to promote their own frames of what the referendum was about.On 18 September 2014, the Scottish electorate will be called to answer a fundamental question about the future of the UK and Scotland: the decision of whether Scotland will become an independent state or remain a part of the UK will have an impact not only on the relationship between the British nations but also on other parts of Europe with similar concerns. Yet, as is the case with any contested issue, the definition of what this referendum is about will be negotiated between political and social groups, debated in the media and deliberated by voters before making their decision. Is the referendum a competition between two opponents fighting for the vote? Is it a matter of identity (shared or distinctive)? Is it a matter of economic survival and growth? This research will examine how the 2014 Scottish independence referendum campaign is framed in the news coverage of the two main television channels catering for audiences in Central Scotland, BBC Scotland and STV. The importance of television as a trusted source of news on political issues is constantly reaffirmed by surveys (Ofcom, 2013, Eurobarometer, 2012) and therefore what television says about a major political event is significant. The study will focus on Scottish news and current affairs coverage referring to the referendum in the final month of the campaign, create an original set of frames emerging from the coverage and measure which of them were more prominent. The project will also use structured interviews with political editors, heads of news and current affairs, political and civil society actors, to discuss how these representations were shaped in the interaction between journalists, media organisations and their sources. The project will contribute to public analysis of the news coverage of the referendum in the aftermath of the event and create opportunities for stakeholders to discuss how broadcasting contributes to the democratic process, through the way it reports on campaigns. All programmes specified in the abstract were watched and coded for presence or absence of a number of frames, based on the following indicators. Indicators of game frame: emphasis on political strategy; war, game and horse-race metaphors; emphasis on who is winning or losing; reports of how the two sides are doing in polls; analyses of politicians’ performance. Indicators of policy frame: focus on policy problems, politicians’ proposals for their solution and their implications for the public. Indicators of identity frame: references to Scottish distinctiveness; references to the common features and history that Scots share with the rest of the UK. Indicators of self-determination frame: references to Scotland making decisions separately from the rest of the UK (not specifying what decisions); references to Scotland getting the governments it votes for. Indicators of divorce frame: marriage, relationship and/or breaking up metaphors; representation of Scotland and England as human partners or friends falling out. Indicators of national division frame: reports on current division in Scotland, emphasis on conflictive nature of the referendum. Indicators of democratic achievement frame: references to the referendum as a major achievement for democracy, reports on high involvement of citizens in debate, reports on high turnout, praise for the civility with which the referendum was carried out. Indicators of social justice frame: references to Scotland becoming a more fair society; general references to resolving social injustices. Indicators of constitutional change frame: emphasis on achieving more powers for Scotland, references to changing the constitutional status of Scotland; reports on proposals for a federal UK, devo-max.

  12. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Crawl Feeds (2025). BBC News Dataset – February 2023 Edition [Dataset]. https://crawlfeeds.com/datasets/bbc-news-dataset-feb-2023
Organization logo

BBC News Dataset – February 2023 Edition

BBC News Dataset – February 2023 Edition from bbc.com

Explore at:
zip, csvAvailable download formats
Dataset updated
Jun 14, 2025
Dataset authored and provided by
Crawl Feeds
License

https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy

Description

Get access to a comprehensive and structured dataset of BBC News articles, freshly crawled and compiled in February 2023. This collection includes 1 million records from one of the world’s most trusted news organizations — perfect for training NLP models, sentiment analysis, and trend detection across global topics.

💾 Format: CSV (available in ZIP archive)

📢 Status: Published and available for immediate access

Use Cases

  • Train language models to summarize or categorize news

  • Detect media bias and compare narrative framing

  • Conduct research in journalism, politics, and public sentiment

  • Enrich news aggregation platforms with clean metadata

  • Analyze content distribution across categories (e.g. health, politics, tech)

This dataset ensures reliable and high-quality information sourced from a globally respected outlet. The format is optimized for quick ingestion into your pipelines — with clean text, timestamps, image links, and more.

Need a filtered dataset or want this refreshed for a later date? We offer on-demand news scraping as well.

👉 Request access or sample now

Search
Clear search
Close search
Google apps
Main menu