Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Endogenous peptides play an important role in multiple biological processes in many species. Liquid chromatography coupled to tandem mass spectrometry (LC–MS/MS) is an important technique for detecting these peptides on a large scale. We present PPIP, which is a dedicated peptidogenomics software for identifying endogenous peptides based on peptidomics and RNA-Seq data. This software automates the de novo transcript assembly based on RNA-Seq data, construction of a protein reference database based on the de novo assembled transcripts, peptide identification, function analysis, and HTML-based report generation. Different function components are integrated using Docker technology. The Docker image of PPIP is available at https://hub.docker.com/r/shawndp/ppip, and the source code under GPL-3 license is available at https://github.com/Shawn-Xu/PPIP. A user manual of PPIP is available at https://shawn-xu.github.io/PPIP.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Most of the existing computational tools for de novo library design are focused on the generation, rational selection, and combination of promising structural motifs to form members of the new library. However, the absence of a direct link between the chemical space of the retrosynthetically generated fragments and the pool of available reagents makes such approaches appear as rather theoretical and reality-disconnected. In this context, here we present Synthons Interpreter (SynthI), a new open-source toolkit for de novo library design that allows merging those two chemical spaces into a single synthons space. Here synthons are defined as actual fragments with valid valences and special labels, specifying the position and the nature of reactive centers. They can be issued from either the “breakup” of reference compounds according to 38 retrosynthetic rules or real reagents, after leaving group withdrawal or transformation. Such an approach not only enables the design of synthetically accessible libraries and analog generation but also facilitates reagents (building blocks) analysis in the medicinal chemistry context. SynthI code is publicly available at https://github.com/Laboratoire-de-Chemoinformatique/SynthI.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The continuous improvement of computer architectures allows for the simulation of molecular systems of growing sizes. However, such calculations still require the input of initial structures, which are also becoming increasingly complex. In this work, we present CAT, a Compound Attachment Tool (source code available at https://github.com/nlesc-nano/CAT) and Python package for the automatic construction of composite chemical compounds, which supports the functionalization of organic, inorganic, and hybrid organic–inorganic materials. The CAT workflow consists in defining the anchoring sites on the reference material, usually a large molecular system denoted as a scaffold, and on the molecular species that are attached to it, i.e., the ligands. Usually, ligands are pre-optimized in a conformation biased toward more linear structures to minimize interligand(s) steric interactions, a bias that is important when multiple ligands are attached onto the scaffold. The resulting superstructure(s) are then stored in various formats that can be used afterward in quantum chemical calculations or classical force field-based simulations.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Publicly available geocoded social determinants of health and mobility datasets used in the analysis of "Chronic Acid Suppression and Social Determinants of COVID-19 Infection".These datasets are required for the analytical workflow shared on Github which demonstrates how the analysis in the manuscript was done using randomly generated samples to protect patient privacy.zcta_county_rel_10.txt - Population and housing density from the 2010 decennial census. Obtained from: https://www2.census.gov/geo/docs/maps-data/data/rel/zcta_county_rel_10.txtcre-2018-a11.csv - Community Resilience Estimates which is is the capacity of individuals and households to absorb, endure, and recover from the health, social, and economic impacts of a disaster such as a hurricane or pandemic. Data obtained from: https://www.census.gov/data/experimental-data-products/community-resilience-estimates.htmlzcta_tract_rel_10.txt - Relationship between ZCTA and US Census tracts (used to map census tracts to ZCTA). Data obtained from: https://www.census.gov/geographies/reference-files/time-series/geo/relationship-files.html#par_textimage_674173622mask-use-by-county.txt - Mask Use By County comes from a large number of interviews conducted online by the global data and survey firm Dynata at the request of The New York Times. The firm asked a question about mask use to obtain 250,000 survey responses between July 2 and July 14, enough data to provide estimates more detailed than the state level. Data obtained from: https://github.com/nytimes/covid-19-data/tree/master/mask-usemobility_report_US.txt - Google mobility report which charts movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential. Data obtained from: https://github.com/ActiveConclusion/COVID19_mobility/blob/master/google_reports/mobility_report_US.csvACS2015_zctaallvars.csv - Social Deprivation Index is a composite measure of area level deprivation based on seven demographic characteristics collected in the American Community Survey (https://www.census.gov/programs-surveys/acs/) and used to quantify the socio-economic variation in health outcomes. Factors are: Income, Education, Employment, Housing, Household Characteristics, Transportation, Demographics. Data obtained from: https://www.graham-center.org/rgc/maps-data-tools/sdi/social-deprivation-index.html
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
One persistent challenge in untargeted metabolomics is the identification of compounds from their mass spectrometry (MS) signal, which is necessary for biological data interpretation. This process can be facilitated by building in-house libraries of metabolite standards containing retention time (RT) information, which is orthogonal and complementary to large, published MS/MS spectra repositories. Creating such libraries can require substantial effort and is time intensive. To streamline this process, we developed metScribeR, an R package with a Shiny application to accelerate the creation of RT and m/z libraries. metScribeR provides an easy, user-friendly interface for peak finding, filtering, and comprehensive quality review of the MS data. Uniquely, metScribeR does not require MS/MS spectral information and reports an identification probability estimate for each adduct. In our benchmarking, metScribeR required approximately 10 s of computational and manual effort per standard, showed a correlation of 0.99 between manual and metScribeR-derived RTs, and appropriately filtered out poor quality peaks. The metScribeR output is a.csv file including the identity, m/z, RT, and peak quality information for standards along with MS/MS spectra retrieved from MassBank of North America (MoNA). metScribeR is open source and available for download on GitHub at https://github.com/ncats/metScribeR
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Endogenous peptides play an important role in multiple biological processes in many species. Liquid chromatography coupled to tandem mass spectrometry (LC–MS/MS) is an important technique for detecting these peptides on a large scale. We present PPIP, which is a dedicated peptidogenomics software for identifying endogenous peptides based on peptidomics and RNA-Seq data. This software automates the de novo transcript assembly based on RNA-Seq data, construction of a protein reference database based on the de novo assembled transcripts, peptide identification, function analysis, and HTML-based report generation. Different function components are integrated using Docker technology. The Docker image of PPIP is available at https://hub.docker.com/r/shawndp/ppip, and the source code under GPL-3 license is available at https://github.com/Shawn-Xu/PPIP. A user manual of PPIP is available at https://shawn-xu.github.io/PPIP.