31 datasets found
  1. S

    AMiner

    • scidb.cn
    Updated Sep 29, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Huaiyu Wan; Yutao Zhang; Jing Zhang; Jie Tang (2020). AMiner [Dataset]. http://doi.org/10.11922/sciencedb.j00104.00004
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 29, 2020
    Dataset provided by
    Science Data Bank
    Authors
    Huaiyu Wan; Yutao Zhang; Jing Zhang; Jie Tang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    AMiner (aminer.org) aims to provide comprehensive search and mining services for researcher social networks. The system focuses on: (1) creating a semantic-based profile for each researcher by extracting information from the distributed Web; (2) integrating academic data (e.g., the bibliographic data and the researcher profiles) from multiple sources; (3) accurately searching the heterogeneous network; (4) analyzing and discovering interesting patterns from the built researcher social network. The main search and analysis functions in AMiner include: profile search, expert finding, conference analysis, course search, sub-graph search, topic browser, academic ranks, and user management.

  2. AMiner Academic Citation Dataset

    • kaggle.com
    zip
    Updated Dec 12, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    K Scott Mader (2018). AMiner Academic Citation Dataset [Dataset]. https://www.kaggle.com/kmader/aminer-academic-citation-dataset
    Explore at:
    zip(2456396542 bytes)Available download formats
    Dataset updated
    Dec 12, 2018
    Authors
    K Scott Mader
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    License

    • Attribution - You must give appropriate credit, provide a link to the our website(url: http://doc.aminer.org), and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
    • NonCommercial - You may not use the material for commercial purposes.
    • NoDerivatives - If you remix, transform, or build upon the material,you may not distribute the modified material.
    • No additional restrictions - You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

    Acknowledgements

    The data was collected and prepared for Non-commercial research use by Aminer (https://aminer.org). These serve as small downloads of a some datasets for exploration

    Inspiration

    • Come up with better academic rankings
    • Determine the groups and cliques within the academic network
  3. h

    aminer-citation-graphv14-jaccard

    • huggingface.co
    Updated Feb 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pascal J Passigan (2024). aminer-citation-graphv14-jaccard [Dataset]. https://huggingface.co/datasets/ppxscal/aminer-citation-graphv14-jaccard
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 24, 2024
    Authors
    Pascal J Passigan
    Description

    Dataset Card for Dataset Name

    This dataset card aims to be a base template for new datasets. It has been generated using this raw template.

      Dataset Details
    
    
    
    
    
      Dataset Description
    

    Contains text pairs from https://www.aminer.org/citation v14. Similairty socres calculated with Jaccard index.

    Curated by: [More Information Needed] Funded by [optional]: [More Information Needed] Shared by [optional]: [More Information Needed] Language(s) (NLP): [More Information… See the full description on the dataset page: https://huggingface.co/datasets/ppxscal/aminer-citation-graphv14-jaccard.

  4. Z

    AMiner-534K - Dataset

    • data.niaid.nih.gov
    • resodate.org
    • +1more
    Updated Oct 14, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Santini, Cristian (2021). AMiner-534K - Dataset [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_5565219
    Explore at:
    Dataset updated
    Oct 14, 2021
    Dataset provided by
    University of Bologna
    Authors
    Santini, Cristian
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is a knowledge graph extracted from a AMiner benchmark for a research project on knowledge graph embeddings (KGEs) for author disambiguation. Structural triples of the knowledge graph are split into training, testing and validation for applying representation learning methods. Textual literals and numeric literals were stored separately in order to implement multimodal approaches for KGEs (see arXiv:1802.00934). For the same reason, textual literals and numeric literals are already stored into sentence embeddings and a numeric matrix respectively in the files textual_literals.npy and numeric_literals.npy. For the script used to gather this dataset see the GitHub repository: https://github.com/sntcristian/and-kge/tree/main/aminer.

  5. s

    AMiner

    • marketplace.sshopencloud.eu
    • opendatalab.com
    Updated Apr 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). AMiner [Dataset]. https://marketplace.sshopencloud.eu/dataset/F942KH
    Explore at:
    Dataset updated
    Apr 24, 2020
    Description

    AMiner (aminer.org) aims to provide comprehensive search and mining services for researcher social networks. In this system, we focus on: (1) creating a semantic-based profile for each researcher by extracting information from the distributed Web; (2) integrating academic data (e.g., the bibliographic data and the researcher profiles) from multiple sources; (3) accurately searching the heterogeneous network; (4) analyzing and discovering interesting patterns from the built researcher social network.

  6. S

    Aminer-na dataset

    • scidb.cn
    Updated Sep 26, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cui huanqing (2022). Aminer-na dataset [Dataset]. http://doi.org/10.57760/sciencedb.j00133.00124
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 26, 2022
    Dataset provided by
    Science Data Bank
    Authors
    Cui huanqing
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The dataset for name disambiguation is composed of 100 author names extracted from aminer database, including 12789 authors and 70258 documents.

  7. n

    aminer

    • networkrepository.com
    csv
    Updated Dec 10, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Network Data Repository (2019). aminer [Dataset]. https://networkrepository.com/ca-aminer.php
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 10, 2019
    Dataset authored and provided by
    Network Data Repository
    License

    https://networkrepository.com/policy.phphttps://networkrepository.com/policy.php

    Description

    Collaboration Networks

  8. S

    Data from: AMiner: Search and Mining of Academic Social Networks

    • scidb.cn
    Updated Oct 15, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Huaiyu Wan; Yutao Zhang; Jing Zhang; Jie Tang (2020). AMiner: Search and Mining of Academic Social Networks [Dataset]. http://doi.org/10.11922/sciencedb.j00104.00021
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 15, 2020
    Dataset provided by
    Science Data Bank
    Authors
    Huaiyu Wan; Yutao Zhang; Jing Zhang; Jie Tang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    8 figures of the paper. Figure 1 presents the architecture of AMiner. Figure 2 shows the schema of the researcher profile. Figure 3 is an example of researcher profile. Figure 4 is an overview of the name disambiguation framework in AMiner. Figure 5 is graphical representation of the three Author-Conference-Topic (ACT) models. Figure 6 shows an example result of experts found for “Data Mining”. Figure 7 is a model framework of DeepInf. Figure 8 shows an example of researcher ranking by sociability index.

  9. Researcher Profile Extraction Dataset

    • figshare.com
    application/x-rar
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jie Tang (2023). Researcher Profile Extraction Dataset [Dataset]. http://doi.org/10.6084/m9.figshare.6050570.v1
    Explore at:
    application/x-rarAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Jie Tang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A copy of the dataset used for researcher profile extraction by Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su.This dataset was downloaded from https://aminer.org/lab-datasets/profiling/

  10. DBLP-Citation-network V13

    • kaggle.com
    zip
    Updated Nov 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nikita Mineev (2022). DBLP-Citation-network V13 [Dataset]. https://www.kaggle.com/datasets/nikitamineev/dblpcitationnetwork-v13
    Explore at:
    zip(3741057992 bytes)Available download formats
    Dataset updated
    Nov 9, 2022
    Authors
    Nikita Mineev
    Description

    Taken from here https://www.aminer.org/citation and converted to csv (but why)

  11. OAGT Paper Topic Dataset

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated May 24, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erion Çano; Erion Çano (2022). OAGT Paper Topic Dataset [Dataset]. http://doi.org/10.5281/zenodo.6560535
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 24, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Erion Çano; Erion Çano
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OAGT is a paper topic dataset consisting of 6942930 records which comprise various scientific publication attributes like abstracts, titles, keywords, publication years, venues, etc. The last two fields of each record are the topic id from a taxonomy of 27 topics created from the entire collection and the 20 most significant topic words. Each dataset record (sample) is stored as a JSON line in the text file.

    The data is derived from OAG data collection (https://aminer.org/open-academic-graph) which was released
    under ODC-BY license.

    This data (OAGT Paper Topic Dataset) is released under CC-BY license (https://creativecommons.org/licenses/by/4.0/).

    If using it, please cite the following paper:

    Erion Çano, Benjamin Roth: Topic Segmentation of Research Article Collections. ArXiv 2022, CoRR abs/2205.11249, https://doi.org/10.48550/arXiv.2205.11249

  12. PERSON Dataset V2

    • figshare.com
    zip
    Updated Nov 1, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shayan A. Tabrizi; Azadeh Shakery; Mohammad Ali Tavallaei; Masoud Asadpour (2018). PERSON Dataset V2 [Dataset]. http://doi.org/10.6084/m9.figshare.6958514.v1
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 1, 2018
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Shayan A. Tabrizi; Azadeh Shakery; Mohammad Ali Tavallaei; Masoud Asadpour
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PERSON Dataset V2:Dataset created for paper "Search Personalization Based on Social-Network-Based Interestedness Measures." Please cite the paper for any usage.The dataset is produced by data cleaning of AMiner's citation network V2 dataset (https://aminer.org/citation). Anyone who wants to use PERSON V2 dataset must cite Aminer's dataset (as explained in its homepage: Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. ArnetMiner: Extraction and Mining of Academic Social Networks. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD'2008). pp.990-998) as well as the aforementioned paper.It includes two files: 1- authors_giant.txt: the information of authors and their co-authors. The format is as follows: author ID author name
    the list of coauthors delimited by "," (Each entry contains the ID of the coauthor followed by the number of times they co-authored a paper) ... 2- papers_giant.txt: the information of papers and references. The format is as follows: paper ID Is paper merged (See the first paper for details) original paper ID (in Aminer's dataset) blank blank blank blank title abstract time (only the year part is important) blank references to papers out of the PERSON dataset (indicated by Aminer's IDs) references to papers inside the PERSON dataset (indicated by PERSON's IDs) author IDs ...

  13. r

    AMiner-534K: Knowledge Graph of AMiner benchmark for Author Name...

    • resodate.org
    • nde-dev.biothings.io
    • +1more
    Updated Nov 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Cristian Santini; Mehwish Alam; Gesese. Genet Asefa; Silvio Peroni; Aldo Gangemi; Harald Sack (2021). AMiner-534K: Knowledge Graph of AMiner benchmark for Author Name Disambiguation [Dataset]. https://resodate.org/resources/aHR0cHM6Ly96ZW5vZG8ub3JnL3JlY29yZHMvNTY3NTgwMQ==
    Explore at:
    Dataset updated
    Nov 12, 2021
    Dataset provided by
    Zenodo
    Authors
    Cristian Santini; Mehwish Alam; Gesese. Genet Asefa; Silvio Peroni; Aldo Gangemi; Harald Sack
    Description

    This dataset is a knowledge graph extracted from aAMiner benchmarkfor a research project on knowledge graph embeddings (KGEs)for author disambiguation. Structural triples of the knowledge graph are split into training, testing and validation for applying representation learning methods. Textual literals and numeric literals were stored separately in order to implement multimodal approaches for KGEs (seearXiv:1802.00934). For the same reason, textual literals and numeric literals are already stored into sentence embeddings and anumeric matrixrespectively in the filestextual_literals.npyandnumeric_literals.npy. The fileand_eval.jsoncontains the evaluation dataset used for evaluating our AND architecture. For the script used to gather this dataset see the GitHub repository:https://github.com/sntcristian/and-kge/tree/main/aminer.

  14. Citation Networks

    • kaggle.com
    zip
    Updated Jul 20, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sharukh Rahman (2022). Citation Networks [Dataset]. https://www.kaggle.com/datasets/devintheai/citation-networks/versions/1
    Explore at:
    zip(514887849 bytes)Available download formats
    Dataset updated
    Jul 20, 2022
    Authors
    Sharukh Rahman
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Original Dataset: aminer.org and kaggle:Citation Network Dataset

    The citation data is extracted from DBLP, ACM, MAG (Microsoft Academic Graph), and other sources. The first version contains 629,814 papers and 632,752 citations.

    DBLP-Citation-network V12: 4,894,081 papers and 45,564,149 citation relationships (2020-04-09)

  15. text features aminer

    • kaggle.com
    zip
    Updated Nov 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Салятов Юрий Леонидович (2025). text features aminer [Dataset]. https://www.kaggle.com/datasets/yurysalyatov/text-features-aminer
    Explore at:
    zip(12763512 bytes)Available download formats
    Dataset updated
    Nov 5, 2025
    Authors
    Салятов Юрий Леонидович
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Салятов Юрий Леонидович

    Released under Apache 2.0

    Contents

  16. E

    OAGSX Title Generation Dataset

    • live.european-language-grid.eu
    binary format
    Updated Oct 31, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). OAGSX Title Generation Dataset [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/1286
    Explore at:
    binary formatAvailable download formats
    Dataset updated
    Oct 31, 2019
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OAGSX is a title generation dataset consisting of 34408509 abstracts and titles from scientific articles. The texts were lowercased and tokenized with Stanford CoreNLP tokenizer. No other preprocessing steps were applied in this release version. Dataset records (samples) are stored as JSON lines in each text file.

    The data is derived from OAG data collection (https://aminer.org/open-academic-graph) which was released under ODC-BY license.

    This data (OAGSX Title Generation Dataset) is released under CC-BY license (https://creativecommons.org/licenses/by/4.0/).

    If using it, please consider citing also the following paper:

    Çano Erion, Bojar Ondřej. Two Huge Title and Keyword Generation Corpora of Research Articles.

    LREC 2020, Proceedings of the the 12th International Conference on Language Resources and Evaluation,

    Marseille, France, May 2020.

  17. g

    Computer Science (1970-2014)

    • search.gesis.org
    • datacatalogue.cessda.eu
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lietz, Haiko, Computer Science (1970-2014) [Dataset]. https://search.gesis.org/research_data/SDN-10.7802-2642
    Explore at:
    Dataset provided by
    GESIS, Köln
    GESIS search
    Authors
    Lietz, Haiko
    License

    https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms

    Description

    DBLP (https://dblp.org/) is a comprehensive collection of computer science publications from major and minor journals and conference proceedings. From this dump, we remove arXiv preprints. Our dataset consists of 1.9 million publications from 1970 to 2014 that are authored by 1.1 million authors. We have added citations among publications by combining DBLP with the AMiner dataset (https://www.aminer.org/citation) via publication titles and years. There are 6.6 million citations among publications. Author names in DBLP are disambiguated. To infer the gender of authors, we have used a method that combines the results of name-based and image-based gender detection services. Since the accuracy is very low for Chinese and Korean names, we label their gender as unknown to reduce noise in our analysis.

  18. DBLP Article Similarities (DBLP-ArtSim) dataset

    • zenodo.org
    csv
    Updated Feb 27, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Serafeim Chatzopoulos; Serafeim Chatzopoulos; Thanasis Vergoulis; Thanasis Vergoulis; Ilias Kanellos; Ilias Kanellos; Theodore Dalamagas; Christos Tryfonopoulos; Theodore Dalamagas; Christos Tryfonopoulos (2021). DBLP Article Similarities (DBLP-ArtSim) dataset [Dataset]. http://doi.org/10.5281/zenodo.3778916
    Explore at:
    csvAvailable download formats
    Dataset updated
    Feb 27, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Serafeim Chatzopoulos; Serafeim Chatzopoulos; Thanasis Vergoulis; Thanasis Vergoulis; Ilias Kanellos; Ilias Kanellos; Theodore Dalamagas; Christos Tryfonopoulos; Theodore Dalamagas; Christos Tryfonopoulos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains similarity scores among articles in AMiner's DBLP v10 dataset.

    Similarities are calculated using the JoinSim [1] similarity measure on the derived citation network using the following metapaths:

    • Paper - Author - Paper (PAP_similarities.csv)
    • Paper - Topic - Paper (PTP_similarities.csv)

    The file ids.csv contains a mapping from AMiner's ids to our internal numeric ids used in the similarities files.

    [1] Xiong, Y., Zhu, Y., Yu, P.S.: Top-k similarity join in heterogeneous information networks. IEEE Transactions on Knowledge and Data Engineering 27(6), 1710– 1723 (2015)

  19. E

    OAGK Keyword Generation Dataset

    • live.european-language-grid.eu
    binary format
    Updated Mar 31, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). OAGK Keyword Generation Dataset [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/1263
    Explore at:
    binary formatAvailable download formats
    Dataset updated
    Mar 31, 2019
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    OAGK is a keyword extraction/generation dataset consisting of 2.2 million abstracts, titles and keyword strings from cientific articles. Texts were lowercased and tokenized with Stanford CoreNLP tokenizer. No other preprocessing steps were applied in this release version. Dataset records (samples) are stored as JSON lines in each text file.

    This data is derived from OAG data collection (https://aminer.org/open-academic-graph) which was released under ODC-BY licence.

    This data (OAGK Keyword Generation Dataset) is released under CC-BY licence (https://creativecommons.org/licenses/by/4.0/).

    If using it, please cite the following paper:

    Çano, Erion and Bojar, Ondřej, 2019, Keyphrase Generation: A Text Summarization Struggle, 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics, June 2019, Minneapolis, USA

  20. Aminer Citation Network Dataset V11

    • kaggle.com
    zip
    Updated May 10, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abdullah D (2021). Aminer Citation Network Dataset V11 [Dataset]. https://www.kaggle.com/abdullahdekebobeketa/aminer-citation-ntk-dataset-v11
    Explore at:
    zip(4029255777 bytes)Available download formats
    Dataset updated
    May 10, 2021
    Authors
    Abdullah D
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Abdullah D

    Released under CC BY-NC-SA 4.0

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Huaiyu Wan; Yutao Zhang; Jing Zhang; Jie Tang (2020). AMiner [Dataset]. http://doi.org/10.11922/sciencedb.j00104.00004

AMiner

Explore at:
378 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 29, 2020
Dataset provided by
Science Data Bank
Authors
Huaiyu Wan; Yutao Zhang; Jing Zhang; Jie Tang
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

AMiner (aminer.org) aims to provide comprehensive search and mining services for researcher social networks. The system focuses on: (1) creating a semantic-based profile for each researcher by extracting information from the distributed Web; (2) integrating academic data (e.g., the bibliographic data and the researcher profiles) from multiple sources; (3) accurately searching the heterogeneous network; (4) analyzing and discovering interesting patterns from the built researcher social network. The main search and analysis functions in AMiner include: profile search, expert finding, conference analysis, course search, sub-graph search, topic browser, academic ranks, and user management.

Search
Clear search
Close search
Google apps
Main menu