3 datasets found
  1. ICIJ OffshoreLeaks

    • opensanctions.org
    Updated Jul 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    International Consortium of Investigative Journalists (2025). ICIJ OffshoreLeaks [Dataset]. https://www.opensanctions.org/datasets/icij_offshoreleaks/
    Explore at:
    application/json+ftmAvailable download formats
    Dataset updated
    Jul 13, 2025
    Dataset authored and provided by
    International Consortium of Investigative Journalistshttps://www.icij.org/
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Combined data from the Panama Papers, Paradise Papers, Pandora Papers and other cross-border investigations conducted by ICIJ and its partners.

  2. Paradise-Panama-Papers

    • kaggle.com
    Updated Nov 21, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zeeshan-ul-hassan Usmani (2017). Paradise-Panama-Papers [Dataset]. http://doi.org/10.34740/kaggle/dsv/7596
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 21, 2017
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Zeeshan-ul-hassan Usmani
    License

    Attribution-ShareAlike 3.0 (CC BY-SA 3.0)https://creativecommons.org/licenses/by-sa/3.0/
    License information was derived automatically

    Description

    Context

    The Paradise Papers is a cache of some 13GB of data that contains 13.4 million confidential records of offshore investment by 120,000 people and companies in 19 tax jurisdictions (Tax Heavens - an awesome video to understand this); that was published by the International Consortium of Investigative Journalists (ICIJ) on November 5, 2017. Here is a brief video about the leak. The people include Queen Elizabeth II, the President of Columbia (Juan Manuel Santos), Former Prime Minister of Pakistan (Shaukat Aziz), U.S Secretary of Commerce (Wilbur Ross) and many more. According to an estimate by the Boston Consulting Group, the amount of money involved is around $10 trillion. The leak contains many famous companies, including Facebook, Apple, Uber, Nike, Walmart, Allianz, Siemens, McDonald’s and Yahoo.

    It also contains a lot of U. S President Donald Trump allies including Rax Tillerson, Wilbur Ross, Koch Brothers, Paul Singer, Sheldon Adelson, Stephen Schwarzman, Thomas Barrack and Steve Wynn etc. The complete list of Politicians involve is avaiable here.

    The Panama Papers in the cache of 38GB of data from the national corporate registry of Bahamas. It contains world’s top politicians and influential persons as head and director of offshore companies registered in Bahamas.

    Offshore Leaks details 13,000 offshore accounts in a report.

    I am calling all data scientists to help me stop the corruption and reveal the patterns and linkages invisible for the untrained eye.

    Content

    The data is the effort of more than 100 journalists from 60+ countries

    The original data is available under creative common license and can be downloaded from this link.

    I will keep updating the datasets with more leaks and data as it’s available

    Acknowledgements

    International Consortium of Investigative Journalists (ICIJ)

    Paradise Papers Update

    Paradise Papers data has been uploaded as released by ICIJ on Nov 21, 2017. You can find Paradise Papers zip file and six extracted files in CSV format, all starting with a prefix of Paradise. Happy Coding!

    Inspiration

    Some ideas worth exploring:

    1. How many companies and individuals are there in all of the leaks data

    2. How many countries involved

    3. Total money involved

    4. What is the biggest best tax heaven

    5. Can we compare the corruption with human development index and make an argument that would correlate corruption with bad conditions in that country

    6. Who are the biggest cheaters and where they live

    7. What role Fortune 500 companies play in this game

    I need your help to make this world corruption free in the age of NLP and Big Data

  3. The results of direct mappings of panama paper to RDF and RDF-star

    • zenodo.org
    bin
    Updated May 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    shahrzad khayatbashi; shahrzad khayatbashi; Sebastián Ferrada; Sebastián Ferrada; Olaf Hartig; Olaf Hartig (2022). The results of direct mappings of panama paper to RDF and RDF-star [Dataset]. http://doi.org/10.5281/zenodo.6524085
    Explore at:
    binAvailable download formats
    Dataset updated
    May 7, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    shahrzad khayatbashi; shahrzad khayatbashi; Sebastián Ferrada; Sebastián Ferrada; Olaf Hartig; Olaf Hartig
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    In the paper 'Converting Property Graphs to RDF: A Preliminary Study of the Practical Impact of Different Mappings', we have used mappings introduced by Nguyen et al. [1], and Hartig [2]. Data sets are the results of direct mappings of the real-world LPG with data about the Panama Papers[3].

    Shahrzad Khayatbashi, Sebastián Ferrada, and Olaf Hartig. 2022. Converting Property Graphs to RDF: A Preliminary Study of the Practical Impact of Different Mappings. In Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA) (GRADES & NDA’22), June 12, 2022, Philadelphia, PA, USA. ACM, New York, NY, USA, 8 pages. https://doi.org/10.1145/3534540.3534695

    References:

    [1] Vinh Nguyen, Hong Yung Yip, Harsh Thakkar, Qingliang Li, Evan Bolton, and Olivier Bodenreider. 2019. Singleton Property Graph: Adding A Semantic Web Abstraction Layer to Graph Databases. In Proceedings of the Blockchain enabled Semantic Web Workshop (BlockSW) and Contextualized Knowledge Graphs (CKG) Workshop.

    [2] Olaf Hartig. 2017. Foundations of RDF* and SPARQL*: An Alternative Approach to Statement-Level Metadata in RDF. In Proceedings of the 11th Alberto Mendelzon International Workshop on Foundations of Data Management and the Web (AMW).

    [3] https://offshoreleaks.icij.org/pages/database

  4. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
International Consortium of Investigative Journalists (2025). ICIJ OffshoreLeaks [Dataset]. https://www.opensanctions.org/datasets/icij_offshoreleaks/
Organization logo

ICIJ OffshoreLeaks

Explore at:
6 scholarly articles cite this dataset (View in Google Scholar)
application/json+ftmAvailable download formats
Dataset updated
Jul 13, 2025
Dataset authored and provided by
International Consortium of Investigative Journalistshttps://www.icij.org/
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Combined data from the Panama Papers, Paradise Papers, Pandora Papers and other cross-border investigations conducted by ICIJ and its partners.

Search
Clear search
Close search
Google apps
Main menu