https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Trump ban on social media following capitol riots made Parler and other fringe platforms gain a favourable right leaning follower base. The free-speech advocates on Parler reportedly perpetuated hatred and inflamed conspiracy theories. Few security researchers on twitter have been paying attention to the so-called "Right wing" network and with great effort, they archived around 3billion of the posts on archive.org over the past few months.
As of 11th Jan 2020, Parler was removed from Google and Apple app stores, and the site was taken down by AWS.
There are several txt files, each containing URL to an individual post. There are image, txt, and links to the video files. It also contains deleted posts and videos. https://web.archive.org/web/20210110202718/https://parler.com/post/d18e8fedcaf147649f160267e57bde41 It's beyond the scope to individually pull all the information for analysis here. It's quite big and slow to do it on one computer :) ~ 100 tb.
Twitter : @donk_enby
Sentiment analysis on the text data. Analysis of hate speech and profiling. Deep moji analysis Ideas on how to moderate a platform like this in future.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset Summary
This is a 10K hours subset of English version of the Multilingual LibriSpeech (MLS) dataset. The data archives were restructured from the original ones from OpenSLR to make it easier to stream. MLS dataset is a large multilingual corpus suitable for speech research. The dataset is derived from read audiobooks from LibriVox and consists of 8 languages - English, German, Dutch, Spanish, French, Italian, Portuguese, Polish. It includes about 44.5K hours of English and… See the full description on the dataset page: https://huggingface.co/datasets/parler-tts/mls_eng_10k.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset Card for English MLS
Dataset Summary
This is a streamable version of the English version of the Multilingual LibriSpeech (MLS) dataset. The data archives were restructured from the original ones from OpenSLR to make it easier to stream. MLS dataset is a large multilingual corpus suitable for speech research. The dataset is derived from read audiobooks from LibriVox and consists of 8 languages - English, German, Dutch, Spanish, French, Italian, Portuguese… See the full description on the dataset page: https://huggingface.co/datasets/parler-tts/mls_eng.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Trump ban on social media following capitol riots made Parler and other fringe platforms gain a favourable right leaning follower base. The free-speech advocates on Parler reportedly perpetuated hatred and inflamed conspiracy theories. Few security researchers on twitter have been paying attention to the so-called "Right wing" network and with great effort, they archived around 3billion of the posts on archive.org over the past few months.
As of 11th Jan 2020, Parler was removed from Google and Apple app stores, and the site was taken down by AWS.
There are several txt files, each containing URL to an individual post. There are image, txt, and links to the video files. It also contains deleted posts and videos. https://web.archive.org/web/20210110202718/https://parler.com/post/d18e8fedcaf147649f160267e57bde41 It's beyond the scope to individually pull all the information for analysis here. It's quite big and slow to do it on one computer :) ~ 100 tb.
Twitter : @donk_enby
Sentiment analysis on the text data. Analysis of hate speech and profiling. Deep moji analysis Ideas on how to moderate a platform like this in future.