5 datasets found
  1. f

    Data from: PPIP: Automated Software for Identification of Bioactive...

    • acs.figshare.com
    • figshare.com
    xlsx
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mingqiang Rong; Baojin Zhou; Ruo Zhou; Qiong Liao; Yong Zeng; Shaohang Xu; Zhonghua Liu (2023). PPIP: Automated Software for Identification of Bioactive Endogenous Peptides [Dataset]. http://doi.org/10.1021/acs.jproteome.8b00718.s003
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    ACS Publications
    Authors
    Mingqiang Rong; Baojin Zhou; Ruo Zhou; Qiong Liao; Yong Zeng; Shaohang Xu; Zhonghua Liu
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Endogenous peptides play an important role in multiple biological processes in many species. Liquid chromatography coupled to tandem mass spectrometry (LC–MS/MS) is an important technique for detecting these peptides on a large scale. We present PPIP, which is a dedicated peptidogenomics software for identifying endogenous peptides based on peptidomics and RNA-Seq data. This software automates the de novo transcript assembly based on RNA-Seq data, construction of a protein reference database based on the de novo assembled transcripts, peptide identification, function analysis, and HTML-based report generation. Different function components are integrated using Docker technology. The Docker image of PPIP is available at https://hub.docker.com/r/shawndp/ppip, and the source code under GPL-3 license is available at https://github.com/Shawn-Xu/PPIP. A user manual of PPIP is available at https://shawn-xu.github.io/PPIP.

  2. f

    Data from: SynthI: A New Open-Source Tool for Synthon-Based Library Design

    • acs.figshare.com
    • datasetcatalog.nlm.nih.gov
    zip
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yuliana Zabolotna; Dmitriy M. Volochnyuk; Sergey V. Ryabukhin; Kostiantyn Gavrylenko; Dragos Horvath; Olga Klimchuk; Oleksandr Oksiuta; Gilles Marcou; Alexandre Varnek (2023). SynthI: A New Open-Source Tool for Synthon-Based Library Design [Dataset]. http://doi.org/10.1021/acs.jcim.1c00754.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    ACS Publications
    Authors
    Yuliana Zabolotna; Dmitriy M. Volochnyuk; Sergey V. Ryabukhin; Kostiantyn Gavrylenko; Dragos Horvath; Olga Klimchuk; Oleksandr Oksiuta; Gilles Marcou; Alexandre Varnek
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    Most of the existing computational tools for de novo library design are focused on the generation, rational selection, and combination of promising structural motifs to form members of the new library. However, the absence of a direct link between the chemical space of the retrosynthetically generated fragments and the pool of available reagents makes such approaches appear as rather theoretical and reality-disconnected. In this context, here we present Synthons Interpreter (SynthI), a new open-source toolkit for de novo library design that allows merging those two chemical spaces into a single synthons space. Here synthons are defined as actual fragments with valid valences and special labels, specifying the position and the nature of reactive centers. They can be issued from either the “breakup” of reference compounds according to 38 retrosynthetic rules or real reagents, after leaving group withdrawal or transformation. Such an approach not only enables the design of synthetically accessible libraries and analog generation but also facilitates reagents (building blocks) analysis in the medicinal chemistry context. SynthI code is publicly available at https://github.com/Laboratoire-de-Chemoinformatique/SynthI.

  3. f

    Data from: CAT: A Compound Attachment Tool for the Construction of Composite...

    • acs.figshare.com
    • figshare.com
    txt
    Updated Jun 5, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bas van Beek; Juliette Zito; Lucas Visscher; Ivan Infante (2023). CAT: A Compound Attachment Tool for the Construction of Composite Chemical Compounds [Dataset]. http://doi.org/10.1021/acs.jcim.2c00690.s002
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 5, 2023
    Dataset provided by
    ACS Publications
    Authors
    Bas van Beek; Juliette Zito; Lucas Visscher; Ivan Infante
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The continuous improvement of computer architectures allows for the simulation of molecular systems of growing sizes. However, such calculations still require the input of initial structures, which are also becoming increasingly complex. In this work, we present CAT, a Compound Attachment Tool (source code available at https://github.com/nlesc-nano/CAT) and Python package for the automatic construction of composite chemical compounds, which supports the functionalization of organic, inorganic, and hybrid organic–inorganic materials. The CAT workflow consists in defining the anchoring sites on the reference material, usually a large molecular system denoted as a scaffold, and on the molecular species that are attached to it, i.e., the ligands. Usually, ligands are pre-optimized in a conformation biased toward more linear structures to minimize interligand(s) steric interactions, a bias that is important when multiple ligands are attached onto the scaffold. The resulting superstructure(s) are then stored in various formats that can be used afterward in quantum chemical calculations or classical force field-based simulations.

  4. Datasets supporting analytical workflow of: Chronic Acid Suppression and...

    • figshare.com
    txt
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bing Zhang; Anna Silverman; Saroja Bangaru; Douglas Arneson; Sonya Dasharathy; Nghia Nguyen; Diane Rodden; Jonathan Shih; Atul Butte; Wael El-Nachef; Brigid Boland; Vivek Rudrapatna (2023). Datasets supporting analytical workflow of: Chronic Acid Suppression and Social Determinants of COVID-19 Infection [Dataset]. http://doi.org/10.6084/m9.figshare.13380356.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Bing Zhang; Anna Silverman; Saroja Bangaru; Douglas Arneson; Sonya Dasharathy; Nghia Nguyen; Diane Rodden; Jonathan Shih; Atul Butte; Wael El-Nachef; Brigid Boland; Vivek Rudrapatna
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Publicly available geocoded social determinants of health and mobility datasets used in the analysis of "Chronic Acid Suppression and Social Determinants of COVID-19 Infection".These datasets are required for the analytical workflow shared on Github which demonstrates how the analysis in the manuscript was done using randomly generated samples to protect patient privacy.zcta_county_rel_10.txt - Population and housing density from the 2010 decennial census. Obtained from: https://www2.census.gov/geo/docs/maps-data/data/rel/zcta_county_rel_10.txtcre-2018-a11.csv - Community Resilience Estimates which is is the capacity of individuals and households to absorb, endure, and recover from the health, social, and economic impacts of a disaster such as a hurricane or pandemic. Data obtained from: https://www.census.gov/data/experimental-data-products/community-resilience-estimates.htmlzcta_tract_rel_10.txt - Relationship between ZCTA and US Census tracts (used to map census tracts to ZCTA). Data obtained from: https://www.census.gov/geographies/reference-files/time-series/geo/relationship-files.html#par_textimage_674173622mask-use-by-county.txt - Mask Use By County comes from a large number of interviews conducted online by the global data and survey firm Dynata at the request of The New York Times. The firm asked a question about mask use to obtain 250,000 survey responses between July 2 and July 14, enough data to provide estimates more detailed than the state level. Data obtained from: https://github.com/nytimes/covid-19-data/tree/master/mask-usemobility_report_US.txt - Google mobility report which charts movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential. Data obtained from: https://github.com/ActiveConclusion/COVID19_mobility/blob/master/google_reports/mobility_report_US.csvACS2015_zctaallvars.csv - Social Deprivation Index is a composite measure of area level deprivation based on seven demographic characteristics collected in the American Community Survey (https://www.census.gov/programs-surveys/acs/) and used to quantify the socio-economic variation in health outcomes. Factors are: Income, Education, Employment, Housing, Household Characteristics, Transportation, Demographics. Data obtained from: https://www.graham-center.org/rgc/maps-data-tools/sdi/social-deprivation-index.html

  5. f

    Data from: MetScribeR: A Semiautomated Tool for Data Processing of In-House...

    • figshare.com
    csv
    Updated Nov 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Adam M. Tisch; Jason M. Inman; Ewy A. Mathé; Djawed Bennouna (2025). MetScribeR: A Semiautomated Tool for Data Processing of In-House LC-MS Metabolite Reference Libraries [Dataset]. http://doi.org/10.1021/acs.jproteome.5c00548.s002
    Explore at:
    csvAvailable download formats
    Dataset updated
    Nov 17, 2025
    Dataset provided by
    ACS Publications
    Authors
    Adam M. Tisch; Jason M. Inman; Ewy A. Mathé; Djawed Bennouna
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    One persistent challenge in untargeted metabolomics is the identification of compounds from their mass spectrometry (MS) signal, which is necessary for biological data interpretation. This process can be facilitated by building in-house libraries of metabolite standards containing retention time (RT) information, which is orthogonal and complementary to large, published MS/MS spectra repositories. Creating such libraries can require substantial effort and is time intensive. To streamline this process, we developed metScribeR, an R package with a Shiny application to accelerate the creation of RT and m/z libraries. metScribeR provides an easy, user-friendly interface for peak finding, filtering, and comprehensive quality review of the MS data. Uniquely, metScribeR does not require MS/MS spectral information and reports an identification probability estimate for each adduct. In our benchmarking, metScribeR required approximately 10 s of computational and manual effort per standard, showed a correlation of 0.99 between manual and metScribeR-derived RTs, and appropriately filtered out poor quality peaks. The metScribeR output is a.csv file including the identity, m/z, RT, and peak quality information for standards along with MS/MS spectra retrieved from MassBank of North America (MoNA). metScribeR is open source and available for download on GitHub at https://github.com/ncats/metScribeR

  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Mingqiang Rong; Baojin Zhou; Ruo Zhou; Qiong Liao; Yong Zeng; Shaohang Xu; Zhonghua Liu (2023). PPIP: Automated Software for Identification of Bioactive Endogenous Peptides [Dataset]. http://doi.org/10.1021/acs.jproteome.8b00718.s003

Data from: PPIP: Automated Software for Identification of Bioactive Endogenous Peptides

Related Article
Explore at:
xlsxAvailable download formats
Dataset updated
Jun 1, 2023
Dataset provided by
ACS Publications
Authors
Mingqiang Rong; Baojin Zhou; Ruo Zhou; Qiong Liao; Yong Zeng; Shaohang Xu; Zhonghua Liu
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

Endogenous peptides play an important role in multiple biological processes in many species. Liquid chromatography coupled to tandem mass spectrometry (LC–MS/MS) is an important technique for detecting these peptides on a large scale. We present PPIP, which is a dedicated peptidogenomics software for identifying endogenous peptides based on peptidomics and RNA-Seq data. This software automates the de novo transcript assembly based on RNA-Seq data, construction of a protein reference database based on the de novo assembled transcripts, peptide identification, function analysis, and HTML-based report generation. Different function components are integrated using Docker technology. The Docker image of PPIP is available at https://hub.docker.com/r/shawndp/ppip, and the source code under GPL-3 license is available at https://github.com/Shawn-Xu/PPIP. A user manual of PPIP is available at https://shawn-xu.github.io/PPIP.

Search
Clear search
Close search
Google apps
Main menu