Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Combined data from the Panama Papers, Paradise Papers, Pandora Papers and other cross-border investigations conducted by ICIJ and its partners.
Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
License information was derived automatically
The Paradise Papers is a cache of some 13GB of data that contains 13.4 million confidential records of offshore investment by 120,000 people and companies in 19 tax jurisdictions (Tax Heavens - an awesome video to understand this); that was published by the International Consortium of Investigative Journalists (ICIJ) on November 5, 2017. Here is a brief video about the leak. The people include Queen Elizabeth II, the President of Columbia (Juan Manuel Santos), Former Prime Minister of Pakistan (Shaukat Aziz), U.S Secretary of Commerce (Wilbur Ross) and many more. According to an estimate by the Boston Consulting Group, the amount of money involved is around $10 trillion. The leak contains many famous companies, including Facebook, Apple, Uber, Nike, Walmart, Allianz, Siemens, McDonald’s and Yahoo.
It also contains a lot of U. S President Donald Trump allies including Rax Tillerson, Wilbur Ross, Koch Brothers, Paul Singer, Sheldon Adelson, Stephen Schwarzman, Thomas Barrack and Steve Wynn etc. The complete list of Politicians involve is avaiable here.
The Panama Papers in the cache of 38GB of data from the national corporate registry of Bahamas. It contains world’s top politicians and influential persons as head and director of offshore companies registered in Bahamas.
Offshore Leaks details 13,000 offshore accounts in a report.
I am calling all data scientists to help me stop the corruption and reveal the patterns and linkages invisible for the untrained eye.
The data is the effort of more than 100 journalists from 60+ countries
The original data is available under creative common license and can be downloaded from this link.
I will keep updating the datasets with more leaks and data as it’s available
International Consortium of Investigative Journalists (ICIJ)
Paradise Papers data has been uploaded as released by ICIJ on Nov 21, 2017. You can find Paradise Papers zip file and six extracted files in CSV format, all starting with a prefix of Paradise. Happy Coding!
Some ideas worth exploring:
How many companies and individuals are there in all of the leaks data
How many countries involved
Total money involved
What is the biggest best tax heaven
Can we compare the corruption with human development index and make an argument that would correlate corruption with bad conditions in that country
Who are the biggest cheaters and where they live
What role Fortune 500 companies play in this game
I need your help to make this world corruption free in the age of NLP and Big Data
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
In the paper 'Converting Property Graphs to RDF: A Preliminary Study of the Practical Impact of Different Mappings', we have used mappings introduced by Nguyen et al. [1], and Hartig [2]. Data sets are the results of direct mappings of the real-world LPG with data about the Panama Papers[3].
Shahrzad Khayatbashi, Sebastián Ferrada, and Olaf Hartig. 2022. Converting Property Graphs to RDF: A Preliminary Study of the Practical Impact of Different Mappings. In Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA) (GRADES & NDA’22), June 12, 2022, Philadelphia, PA, USA. ACM, New York, NY, USA, 8 pages. https://doi.org/10.1145/3534540.3534695
References:
[1] Vinh Nguyen, Hong Yung Yip, Harsh Thakkar, Qingliang Li, Evan Bolton, and Olivier Bodenreider. 2019. Singleton Property Graph: Adding A Semantic Web Abstraction Layer to Graph Databases. In Proceedings of the Blockchain enabled Semantic Web Workshop (BlockSW) and Contextualized Knowledge Graphs (CKG) Workshop.
[2] Olaf Hartig. 2017. Foundations of RDF* and SPARQL*: An Alternative Approach to Statement-Level Metadata in RDF. In Proceedings of the 11th Alberto Mendelzon International Workshop on Foundations of Data Management and the Web (AMW).
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Combined data from the Panama Papers, Paradise Papers, Pandora Papers and other cross-border investigations conducted by ICIJ and its partners.