A survey exploring perspectives on local news and the coronavirus in the United States in April 2020 revealed that 66 percent of Demcorats were concerned about that their local news organization would be affected by the economic downturn caused by the pandemic. Political party identification greatly altered attitudes to this issue, with just nine percent of Republicans expressing a great deal of concern that their local outlet would be impacted.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in the United States was worth 27720.71 billion US dollars in 2023, according to official data from the World Bank. The GDP value of the United States represents 26.29 percent of the world economy. This dataset provides - United States GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.
This data package includes the underlying data and files to replicate the calculations, charts, and tables presented in Reality Check for the Global Economy, PIIE Briefing 16-3.
If you use the data, please cite as: Acalin, Julien, Olivier Blanchard, Monica de Bolle, José De Gregorio, Caroline Freund, Joseph E. Gagnon, Nicholas R. Lardy, Adam S. Posen, David J. Stockton, and Nicolas Véron. (2016). Reality Check for the Global Economy. PIIE Briefing 16-3. Peterson Institute for International Economics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Discover how data center investments, driven by AI advancements, are projected to boost U.S. economic growth, with major tech companies leading the charge.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Gross Domestic Product (GDP) in the United States expanded 2.50 percent in the fourth quarter of 2024 over the same quarter of the previous year. This dataset provides the latest reported value for - United States GDP Annual Growth Rate - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
https://fred.stlouisfed.org/legal/#copyright-citation-requiredhttps://fred.stlouisfed.org/legal/#copyright-citation-required
Graph and download economic data for St. Louis Fed Economic News Index: Real GDP Nowcast (STLENI) from Q2 2013 to Q1 2025 about nowcast, projection, real, GDP, rate, indexes, and USA.
Live Briefs Investor – US Covering thousands of listed securities and events across 80 news categories, Live Briefs Investor US is specifically designed to keep individual investors and active traders on top of breaking news that is likely to affect their portfolios.
Most of the largest and most respected retail and self-directed brokerage firms in the North America rely on MT Newswires to provide their clients with complete coverage of the financial markets. The Investor service includes timely and insightful commentary on equities, commodities, ETFs, economics, forex, options and fixed income assets throughout the day (6:30 am to 6:30 pm EST).
Every story is ticker-tagged and category-coded to allow for seamless platform integration. US Equities – significant events affecting individual public companies in the US: After-hours and pre-market news, trading activity and technical price level indications; Earnings estimate change alerts; Analyst Rating Changes- the most comprehensive view and coverage of rating changes available anywhere; ETF Power Play – daily trends in ETF trading activity; Mini and detailed sector summaries – pre-market, mid-day, and closing; Market Chatter – real-time coverage of trading desk rumors and breaking news; Zero noise: Only premium, original news and event analysis. Never any fillers (press releases, non-market related news, etc.).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Introduction
There are several works based on Natural Language Processing on newspaper reports. Mining opinions from headlines [ 1 ] using Standford NLP and SVM by Rameshbhaiet. Al.compared several algorithms on a small and large dataset. Rubinet. al., in their paper [ 2 ], created a mechanism to differentiate fake news from real ones by building a set of characteristics of news according to their types. The purpose was to contribute to the low resource data available for training machine learning algorithms. Doumitet. al.in [ 3 ] have implemented LDA, a topic modeling approach to study bias present in online news media.
However, there are not many NLP research invested in studying COVID-19. Most applications include classification of chest X-rays and CT-scans to detect presence of pneumonia in lungs [ 4 ], a consequence of the virus. Other research areas include studying the genome sequence of the virus[ 5 ][ 6 ][ 7 ] and replicating its structure to fight and find a vaccine. This research is crucial in battling the pandemic. The few NLP based research publications are sentiment classification of online tweets by Samuel et el [ 8 ] to understand fear persisting in people due to the virus. Similar work has been done using the LSTM network to classify sentiments from online discussion forums by Jelodaret. al.[ 9 ]. NKK dataset is the first study on a comparatively larger dataset of a newspaper report on COVID-19, which contributed to the virus’s awareness to the best of our knowledge.
2 Data-set Introduction
2.1 Data Collection
We accumulated 1000 online newspaper report from United States of America (USA) on COVID-19. The newspaper includes The Washington Post (USA) and StarTribune (USA). We have named it as “Covid-News-USA-NNK”. We also accumulated 50 online newspaper report from Bangladesh on the issue and named it “Covid-News-BD-NNK”. The newspaper includes The Daily Star (BD) and Prothom Alo (BD). All these newspapers are from the top provider and top read in the respective countries. The collection was done manually by 10 human data-collectors of age group 23- with university degrees. This approach was suitable compared to automation to ensure the news were highly relevant to the subject. The newspaper online sites had dynamic content with advertisements in no particular order. Therefore there were high chances of online scrappers to collect inaccurate news reports. One of the challenges while collecting the data is the requirement of subscription. Each newspaper required $1 per subscriptions. Some criteria in collecting the news reports provided as guideline to the human data-collectors were as follows:
The headline must have one or more words directly or indirectly related to COVID-19.
The content of each news must have 5 or more keywords directly or indirectly related to COVID-19.
The genre of the news can be anything as long as it is relevant to the topic. Political, social, economical genres are to be more prioritized.
Avoid taking duplicate reports.
Maintain a time frame for the above mentioned newspapers.
To collect these data we used a google form for USA and BD. We have two human editor to go through each entry to check any spam or troll entry.
2.2 Data Pre-processing and Statistics
Some pre-processing steps performed on the newspaper report dataset are as follows:
Remove hyperlinks.
Remove non-English alphanumeric characters.
Remove stop words.
Lemmatize text.
While more pre-processing could have been applied, we tried to keep the data as much unchanged as possible since changing sentence structures could result us in valuable information loss. While this was done with help of a script, we also assigned same human collectors to cross check for any presence of the above mentioned criteria.
The primary data statistics of the two dataset are shown in Table 1 and 2.
Table 1: Covid-News-USA-NNK data statistics
No of words per headline
7 to 20
No of words per body content
150 to 2100
Table 2: Covid-News-BD-NNK data statistics No of words per headline
10 to 20
No of words per body content
100 to 1500
2.3 Dataset Repository
We used GitHub as our primary data repository in account name NKK^1. Here, we created two repositories USA-NKK^2 and BD-NNK^3. The dataset is available in both CSV and JSON format. We are regularly updating the CSV files and regenerating JSON using a py script. We provided a python script file for essential operation. We welcome all outside collaboration to enrich the dataset.
3 Literature Review
Natural Language Processing (NLP) deals with text (also known as categorical) data in computer science, utilizing numerous diverse methods like one-hot encoding, word embedding, etc., that transform text to machine language, which can be fed to multiple machine learning and deep learning algorithms.
Some well-known applications of NLP includes fraud detection on online media sites[ 10 ], using authorship attribution in fallback authentication systems[ 11 ], intelligent conversational agents or chatbots[ 12 ] and machine translations used by Google Translate[ 13 ]. While these are all downstream tasks, several exciting developments have been made in the algorithm solely for Natural Language Processing tasks. The two most trending ones are BERT[ 14 ], which uses bidirectional encoder-decoder architecture to create the transformer model, that can do near-perfect classification tasks and next-word predictions for next generations, and GPT-3 models released by OpenAI[ 15 ] that can generate texts almost human-like. However, these are all pre-trained models since they carry huge computation cost. Information Extraction is a generalized concept of retrieving information from a dataset. Information extraction from an image could be retrieving vital feature spaces or targeted portions of an image; information extraction from speech could be retrieving information about names, places, etc[ 16 ]. Information extraction in texts could be identifying named entities and locations or essential data. Topic modeling is a sub-task of NLP and also a process of information extraction. It clusters words and phrases of the same context together into groups. Topic modeling is an unsupervised learning method that gives us a brief idea about a set of text. One commonly used topic modeling is Latent Dirichlet Allocation or LDA[17].
Keyword extraction is a process of information extraction and sub-task of NLP to extract essential words and phrases from a text. TextRank [ 18 ] is an efficient keyword extraction technique that uses graphs to calculate the weight of each word and pick the words with more weight to it.
Word clouds are a great visualization technique to understand the overall ’talk of the topic’. The clustered words give us a quick understanding of the content.
4 Our experiments and Result analysis
We used the wordcloud library^4 to create the word clouds. Figure 1 and 3 presents the word cloud of Covid-News-USA- NNK dataset by month from February to May. From the figures 1,2,3, we can point few information:
In February, both the news paper have talked about China and source of the outbreak.
StarTribune emphasized on Minnesota as the most concerned state. In April, it seemed to have been concerned more.
Both the newspaper talked about the virus impacting the economy, i.e, bank, elections, administrations, markets.
Washington Post discussed global issues more than StarTribune.
StarTribune in February mentioned the first precautionary measurement: wearing masks, and the uncontrollable spread of the virus throughout the nation.
While both the newspaper mentioned the outbreak in China in February, the weight of the spread in the United States are more highlighted through out March till May, displaying the critical impact caused by the virus.
We used a script to extract all numbers related to certain keywords like ’Deaths’, ’Infected’, ’Died’ , ’Infections’, ’Quarantined’, Lock-down’, ’Diagnosed’ etc from the news reports and created a number of cases for both the newspaper. Figure 4 shows the statistics of this series. From this extraction technique, we can observe that April was the peak month for the covid cases as it gradually rose from February. Both the newspaper clearly shows us that the rise in covid cases from February to March was slower than the rise from March to April. This is an important indicator of possible recklessness in preparations to battle the virus. However, the steep fall from April to May also shows the positive response against the attack. We used Vader Sentiment Analysis to extract sentiment of the headlines and the body. On average, the sentiments were from -0.5 to -0.9. Vader Sentiment scale ranges from -1(highly negative to 1(highly positive). There were some cases
where the sentiment scores of the headline and body contradicted each other,i.e., the sentiment of the headline was negative but the sentiment of the body was slightly positive. Overall, sentiment analysis can assist us sort the most concerning (most negative) news from the positive ones, from which we can learn more about the indicators related to COVID-19 and the serious impact caused by it. Moreover, sentiment analysis can also provide us information about how a state or country is reacting to the pandemic. We used PageRank algorithm to extract keywords from headlines as well as the body content. PageRank efficiently highlights important relevant keywords in the text. Some frequently occurring important keywords extracted from both the datasets are: ’China’, Government’, ’Masks’, ’Economy’, ’Crisis’, ’Theft’ , ’Stock market’ , ’Jobs’ , ’Election’, ’Missteps’, ’Health’, ’Response’. Keywords extraction acts as a filter allowing quick searches for indicators in case of locating situations of the economy,
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Government spending in the United States was last recorded at 34.4 percent of GDP in 2023 . This dataset provides - United States Government Spending To Gdp- actual values, historical data, forecast, chart, statistics, economic calendar and news.
The statistic shows GDP per capita in the United Kingdom from 1987 to 2020, with projections up until 2029. In 2020, GDP per capita in the United Kingdom was at around 40,230.55 US dollars. The same year, the total UK population amounted to about 67.26 million people. The United Kingdom is among the leading countries in a world GDP ranking.Falling unemployment in a time of recessionGDP is a useful indicator when it comes to measuring the state of a nation’s economy. GDP is the market value of all final goods and services produced within a country in a given period of time, usually a year. GDP per capita equals exactly the GDI (gross domestic income) per capita and is not a measure of an individual’s personal income.As can be seen clearly in the statistic, gross domestic product (GDP) per capita in the United Kingdom is beginning to increase, albeit not to pre-recession levels. The UK is beginning to see signs of an economic recovery, though as of yet it remains unclear what sort of recovery this is. Questions have been raised as to whether the growth being seen is the right sort of growth for a well balanced recovery across the necessary sectors. An interesting oddity occurred in the United Kingdom for nine months in 2012, which saw a decreasing unemployment occurring at the same time as dip in nationwide economic productivity. This seems like good - if not unusual - news, but could be indicative of people entering part-time employment. It could also suggest that labor productivity is falling, meaning that the UK would be less competitive as a nation. The figures continue to rise, however, with an increase in employment in the private sector. With the rate of inflation in the UK impacting everyone’s daily lives, it is becoming increasingly difficult for vulnerable groups to maintain a decent standard of living.
In 2023, the U.S. Consumer Price Index was 309.42, and is projected to increase to 352.27 by 2029. The base period was 1982-84. The monthly CPI for all urban consumers in the U.S. can be accessed here. After a time of high inflation, the U.S. inflation rateis projected fall to two percent by 2027. United States Consumer Price Index ForecastIt is projected that the CPI will continue to rise year over year, reaching 325.6 in 2027. The Consumer Price Index of all urban consumers in previous years was lower, and has risen every year since 1992, except in 2009, when the CPI went from 215.30 in 2008 to 214.54 in 2009. The monthly unadjusted Consumer Price Index was 296.17 for the month of August in 2022. The U.S. CPI measures changes in the price of consumer goods and services purchased by households and is thought to reflect inflation in the U.S. as well as the health of the economy. The U.S. Bureau of Labor Statistics calculates the CPI and defines it as, "a measure of the average change over time in the prices paid by urban consumers for a market basket of consumer goods and services." The BLS records the price of thousands of goods and services month by month. They consider goods and services within eight main categories: food and beverage, housing, apparel, transportation, medical care, recreation, education, and other goods and services. They aggregate the data collected in order to compare how much it would cost a consumer to buy the same market basket of goods and services within one month or one year compared with the previous month or year. Given that the CPI is used to calculate U.S. inflation, the CPI influences the annual adjustments of many financial institutions in the United States, both private and public. Wages, social security payments, and pensions are all affected by the CPI.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The United States recorded a Government Debt to GDP of 122.30 percent of the country's Gross Domestic Product in 2023. This dataset provides - United States Government Debt To GDP - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Retail Sales in the United States increased 0.20 percent in February of 2025 over the previous month. This dataset provides - U.S. December Retail Sales Increased More Than Forecast - actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Household Saving Rate in the United States increased to 4.60 percent in January from 3.50 percent in December of 2024. This dataset provides - United States Personal Savings Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
We have successfully extracted a comprehensive news dataset from CNBC, covering not only financial updates but also an extensive range of news categories relevant to diverse audiences in Europe, the US, and the UK. This dataset includes over 500,000 records, meticulously structured in JSON format for seamless integration and analysis.
This extensive extraction spans multiple segments, such as:
Each record in the dataset is enriched with metadata tags, enabling precise filtering by region, sector, topic, and publication date.
The comprehensive news dataset provides real-time insights into global developments, corporate strategies, leadership changes, and sector-specific trends. Designed for media analysts, research firms, and businesses, it empowers users to perform:
Additionally, the JSON format ensures easy integration with analytics platforms for advanced processing.
Looking for a rich repository of structured news data? Visit our news dataset collection to explore additional offerings tailored to your analysis needs.
To get a preview, check out the CSV sample of the CNBC economy articles dataset.
By November 2025, it is projected that there is a probability of 33.56 percent that the United States will fall into another economic recession. This reflects a significant decrease from the projection of the preceding month.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Employment Rate in the United States decreased to 59.90 percent in February from 60.10 percent in January of 2025. This dataset provides - United States Employment Rate- actual values, historical data, forecast, chart, statistics, economic calendar and news.
There is substantial evidence that voters’ choices are shaped by assessments of the state of the economy and that these assessments, in turn, are influenced by the news. But how does the economic news track the welfare of different income groups in an era of rising inequality? Whose economy does the news cover? Drawing on a large new dataset of U.S. news content, we demonstrate that the tone of the economic news strongly and disproportionately tracks the fortunes of the richest households, with little sensitivity to income changes among the non-rich. Further, we present evidence that this pro-rich bias emerges not from pro-rich journalistic preferences but, rather, from the interaction of the media’s focus on economic aggregates with structural features of the relationship between economic growth and distribution. The findings yield a novel explanation of distributionally perverse electoral patterns and demonstrate how distributional biases in the economy condition economic accountability.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Real Gross Domestic Product: Private Goods-Producing Industries in Newport News City, VA (REALGDPGOODS51700) from 2002 to 2023 about Newport News, Independent City, goods-producing, VA, private, real, industry, GDP, and USA.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Real Gross Domestic Product: Private Services-Providing Industries in Newport News City, VA (REALGDPSERV51700) from 2002 to 2023 about Newport News, Independent City, services-providing, VA, private, real, industry, GDP, and USA.
A survey exploring perspectives on local news and the coronavirus in the United States in April 2020 revealed that 66 percent of Demcorats were concerned about that their local news organization would be affected by the economic downturn caused by the pandemic. Political party identification greatly altered attitudes to this issue, with just nine percent of Republicans expressing a great deal of concern that their local outlet would be impacted.