Facebook
TwitterWe asked U.S. consumers about "ePaper / online news website usage by brand" and found that "The New York Times" takes the top spot, while "Sfchronicle.com" is at the other end of the ranking.These results are based on a representative online survey conducted in 2024 among 3,605 consumers in the United States. Looking to gain valuable insights about news websites readers worldwide? Check out our
Facebook
TwitterTraffic analytics, rankings, and competitive metrics for nytimes.com as of October 2025
Facebook
Twitterhttps://semrush.ebundletools.com/company/legal/terms-of-service/https://semrush.ebundletools.com/company/legal/terms-of-service/
nytimes.com is ranked #24 in US with 550.63M Traffic. Categories: Newspapers. Learn more about website traffic, market share, and more!
Facebook
TwitterIn 2024, the average weekday print circulation of The New York Times was approximately ******* copies, less than half the figure recorded in 2014. In that year, the company ceased publishing its figures based on weekday circulation for print, online, and other digital platforms, and published only its print circulation. The New York Times The New York Times was founded in 1851 and has been a household name in the United States for decades. The newspaper has adapted well to changes in the media industry, and between the final quarters of 2014 and 2020, paid subscribers to The New York Timesโ digital only news product increased from *** thousand to over ************. The New York Times is also one of the worldโs leading podcast publishers, with unique streams and downloads of the companyโs podcasts reaching tens and sometimes even hundreds of millions per month. Popularity and reliability As one of the most popular news websites in the United States, the NYT has been known to achieve ** million unique monthly visitors, outperforming the likes of NBC News, The Washington Post, and The Guardian. That said, like many news publications, The New York Times has been the subject of controversy over the years. From accusations of liberal bias to its hiring practices, the newspaper has faced challenges regarding not only its published content but also its employees. In spite of this, just ** percent of respondents to a survey seriously doubted the credibility of The New York Times, with most finding the publication to be a reliable source.
Facebook
Twitterhttps://semrush.ebundletools.com/company/legal/terms-of-service/https://semrush.ebundletools.com/company/legal/terms-of-service/
newyorktimes.com is ranked #94175 in US with 107.19K Traffic. Categories: . Learn more about website traffic, market share, and more!
Facebook
TwitterThe dataset was downloaded for one of the hackathons, in which the task was to determine the sentiment of the news.
There are a year of publishing, title and an excerpt from the news in the columns "year", "title", "excerpt".
Data provided by The New York Times https://developer.nytimes.com/
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Code:
Packet_Features_Generator.py & Features.py
To run this code:
pkt_features.py [-h] -i TXTFILE [-x X] [-y Y] [-z Z] [-ml] [-s S] -j
-h, --help show this help message and exit -i TXTFILE input text file -x X Add first X number of total packets as features. -y Y Add first Y number of negative packets as features. -z Z Add first Z number of positive packets as features. -ml Output to text file all websites in the format of websiteNumber1,feature1,feature2,... -s S Generate samples using size s. -j
Purpose:
Turns a text file containing lists of incomeing and outgoing network packet sizes into separate website objects with associative features.
Uses Features.py to calcualte the features.
startMachineLearning.sh & machineLearning.py
To run this code:
bash startMachineLearning.sh
This code then runs machineLearning.py in a tmux session with the nessisary file paths and flags
Options (to be edited within this file):
--evaluate-only to test 5 fold cross validation accuracy
--test-scaling-normalization to test 6 different combinations of scalers and normalizers
Note: once the best combination is determined, it should be added to the data_preprocessing function in machineLearning.py for future use
--grid-search to test the best grid search hyperparameters - note: the possible hyperparameters must be added to train_model under 'if not evaluateOnly:' - once best hyperparameters are determined, add them to train_model under 'if evaluateOnly:'
Purpose:
Using the .ml file generated by Packet_Features_Generator.py & Features.py, this program trains a RandomForest Classifier on the provided data and provides results using cross validation. These results include the best scaling and normailzation options for each data set as well as the best grid search hyperparameters based on the provided ranges.
Data
Encrypted network traffic was collected on an isolated computer visiting different Wikipedia and New York Times articles, different Google search queres (collected in the form of their autocomplete results and their results page), and different actions taken on a Virtual Reality head set.
Data for this experiment was stored and analyzed in the form of a txt file for each experiment which contains:
First number is a classification number to denote what website, query, or vr action is taking place.
The remaining numbers in each line denote:
The size of a packet,
and the direction it is traveling.
negative numbers denote incoming packets
positive numbers denote outgoing packets
Figure 4 Data
This data uses specific lines from the Virtual Reality.txt file.
The action 'LongText Search' refers to a user searching for "Saint Basils Cathedral" with text in the Wander app.
The action 'ShortText Search' refers to a user searching for "Mexico" with text in the Wander app.
The .xlsx and .csv file are identical
Each file includes (from right to left):
The origional packet data,
each line of data organized from smallest to largest packet size in order to calculate the mean and standard deviation of each packet capture,
and the final Cumulative Distrubution Function (CDF) caluclation that generated the Figure 4 Graph.
Facebook
TwitterNew York Times article headlines for NLP, Sentiment Analysis purposes. Downloaded using NYT API.
Field description: - pub_date - publication date (datetime), - abstract - snippet of the Article (string), - headline - Article headline (String), - lead_paragraph - Article lead paragraph (string), - news_desk - Desk in the newsroom that worked on the story, - section_name - Section that the article appeared in, - word_count - number of words in the article (int).
Source: https://developer.nytimes.com/docs/archive-product/1/types/Article Terms of Use: https://developer.nytimes.com/terms If you use this data, you must attribute it to โThe New York Timesโ in any publication.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
NYCDOT's Traffic Management Center (TMC) maintains a map of traffic speed detectors throughout the City. The speed detector themselves belong to various city and state agencies. The Traffic Speeds Map is available on the DOT's website. This data feed contains 'real-time' traffic information from locations where NYCDOT picks up sensor feeds within the five boroughs, mostly on major arterials and highways. NYCDOT uses this information for emergency response and management.
Here's the link to the original dataset.
Facebook
TwitterWeather Channel had 285.6 million average visitors to its website in the 12 months running to May 2024, making it the leading global news brand worldwide in this respect. Following in second place was the New York Times with 113 million web visitors.
Facebook
TwitterIn April 2025, the news website with the most monthly visits in the United States was nytimes.com, with a total of ***** million monthly visits in that month. In second place was cnn.com with just over *** million visits, followed by foxnews.com with almost a ****** of a million. Online news consumption in the U.S. Americans get their news in a variety of ways, but social media is an increasingly popular option. A survey on social media news consumption revealed that ** percent of Twitter users regularly used the site for news, and Facebook and Reddit were also popular for news among their users. Interestingly though, social media is the least trusted news sources in the United States. News and trust Trust in news sources has become increasingly important to the American news consumer amidst the spread of fake news, and the public are more vocal about whether or not they have faith in a source to report news correctly. Ongoing discussions about the credibility, accuracy and bias of news networks, anchors, TV show hosts, and news media professionals mean that those looking to keep up to date tend to be more cautious than ever before. In general, news audiences are skeptical. In 2020, just **** percent of respondents to a survey investigating the perceived objectivity of the mass media reported having a great deal of trust in the media to report news fully, accurately, and fairly.
Facebook
TwitterThe graph shows leading daily newspapers with paywalls in the United States from April to September 2014, by number of paid restricted access website accounts. In that time period, Los Angeles Times ranked fourth with nearly ** thousand paid restricted access website accounts. Digital publishing โ additional information
The New York Times has been the most successful American daily when it comes to attracting readers willing to pay for its online content. The paywall, which was introduced in March 2011, allows users to read ** articles a month for free. Once that limit has been reached, users are required to pay in order to read more articles. According to the New York Times Companyโs own data, the number of paid subscribers to its digital-only products amounted to *** thousand in the second quarter of 2015; a steady growth since its implementation. Circulation revenue now exceeds the advertising revenue. ). The New York Times seems to belong to a successful minority. During a survey in late 2013, more publishers admitted that the introduction of the paywall led to a decrease of traffic on their website. This was the experience of a third of the respondents, whereas a quarter stated the paywall boosted the traffic. Most publishers allow their readers to view **** or *** free articles a month, which makes the aforementioned New York Times model of ** articles more than generous in comparison. In general, the U.S. digital publishing industry is expected to thrive. Between the period 2014 to 2020, revenues are predicted to nearly double, generating approximately **** billion U.S. dollars. Among three major types of digital publishing products โ e-books, e-magazines and e-newspapers โ it is the latter that will develop the most rapidly over the aforementioned period of time.
Facebook
TwitterAttribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
One of the most compelling trends in technology today is the open data and open governance movement. It's not without reason that no less than Tim Berners-Lee himself, the creator of the worldwide web and one of the most preeminent scholars of the Internet, is doing his latest work in getting more government data on the web: in an interview with The New York Times a few years ago he spoke to how even records as mundane as traffic statistics or weather data could drive tinkerers to "make government run better".
New York City has been at the forefront of this movement: mayor Bloomberg formalized a citywide analytics team as the Mayor's Office for Data Analytics in 2013, and the effort has continued under Mayor De Blasio, with the city cementing its first Open Data Plan in July 2015. The resultant NYC Open Data Portal is populated with over 1500 datasets. It was, and is, the largest citywide open data portal in the world.
Nevertheless, a good open data platform is more than a count; it's a function also of all of the maintenance and structure that goes into it. What's a "dataset", who's publishing them, and how well-maintained are they?
This dataset contains the publicly available metadata about the datasets in the NYC Open Data portal, provided in a JSON format.
For an initial exploration of its contents see this blog post.
Facebook
TwitterNYC Traffic Accidents
Data Source: NYC OpenData | Maven Analytics License: Public Domain
Introduction Motor vehicle collisions reported by the New York City Police Department from January-August 2020. Each record represents an individual collision, including the date, time and location of the accident (borough, zip code, street name, latitude/longitude), vehicles and victims involved, and contributing factors.
Recommended Analysis - Compare the % of total accidents by month. Do you notice any seasonal patterns? - Break down accident frequency by day of week and hour of day. Based on this data, when do accidents occur most frequently? - On which particular street were the most accidents reported? What does that represent as a % of all reported accidents? - What was the most common contributing factor for the accidents reported in this sample (based on Vehicle 1)? What about for fatal accidents specifically?
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Number of NYC taxi passengers ๐, where the five anomalies occur during the NYC marathon, Thanksgiving, Christmas, New Years day, and a snow storm. The raw data is from the NYC Taxi and Limousine Commission. The data file included here consists of aggregating the total number of taxi passengers into 30 minute buckets.
The data can be used for analysis, anomaly detection or any relevant study.
Don't forget to upvote ๐ if you like the dataset!
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterWe asked U.S. consumers about "ePaper / online news website usage by brand" and found that "The New York Times" takes the top spot, while "Sfchronicle.com" is at the other end of the ranking.These results are based on a representative online survey conducted in 2024 among 3,605 consumers in the United States. Looking to gain valuable insights about news websites readers worldwide? Check out our