Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Target store USA products dataset having 168 records with 24 datapoint.
Datapoints: title url brand main_image sku description raw_description gtin13 currency price availability availableDeliveryMethod available_branch primary_category sub_category_1 sub_category_2 sub_category_3 images raw_specifications specifications highlights raw_highlights uniq_id scraped_at
Get complete dataset from crawl feeds over more than 800K+ records
Facebook
TwitterIt is a dual function database that associates an informatics database to a structural database of known and potential drug targets. PDTD is a comprehensive, web-accessible database of drug targets, and focuses on those drug targets with known 3D-structures. PDTD contains 1207 entries covering 841 known and potential drug targets with structures from the Protein Data Bank (PDB). Drug targets of PDTD were categorized into 15 and 13 types according to two criteria: therapeutic areas and biochemical criteria. The database supports extensive searching function using PDB ID, target name and category, related disease.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Founded in 1902, Target has grown to become the second-largest discount retailer in the United States. The company currently has 1829 locations throughout the United States
This dataset includes a record for Target location currently in operation as of April 2017. Columns include location data (address, Lat/Lon), store open date, last remodel date, capabilities (integrated Starbucks, CVS, etc.) and several other interesting data points.
I utilized Python’s Requests library to scrape this data from Target’s store locator.
I enjoy plotting geographical data! When I was looking through the data that Target’s store locator returned, I thought others might find it useful, and decided to scrape and clean it into a useful dataset.
Facebook
TwitterToxin and Toxin Target Database (T3DB) is a bioinformatics resource that combines detailed toxin data with comprehensive toxin target information.
Facebook
TwitterA database of oncogenes and tumor suppressor genes. Users can search by genes, chromosomes, and keywords. The coAnsensus domain analysis tool functions to identify conserved protein domains and GO terms among selected TAG genes, while the “oncogenic domain analysis” can analyze oncogenic potential of any user-provided protein based on a weighed term frequency table calculated from the TAG proteins. The completion of human genome sequences allows one to rapidly identify and analyze genes of interest through the use of computational approach. The available annotations including physical characterization and functional domains of known tumor-related genes thus can be used to study the role of genes involved in carcinogenesis. The tumor-associated gene (TAG) database was designed to utilize information from well-characterized oncogenes and tumor suppressor genes to facilitate cancer research. All target genes were identified through text-mining approach from the PubMed database. A semi-automatic information retrieving engine was built to collect specific information of these target genes from various resources and store in the TAG database. At current stage, 519 TAGs including 198 oncogenes, 170 tumor suppressor genes, and 151 genes related to oncogenesis were collected. Information collected in TAG database can be browsed through user-friendly web interfaces that provide searching genes by chromosome or by keywords. The “consensus domain analysis” tool functions to identify conserved protein domains and GO terms among selected TAG genes. In addition, the “oncogenic domain analysis” can analyze oncogenic potential of any user-provided protein based on a weighed term frequency table calculated from the TAG proteins. This study was supported by grant from National research program for genomic medicine (NRPGM) and personnel from Bioinformatics Center of Center for Biotechnology and Biosciences in the National Cheng Kung University, Taiwan.
Facebook
TwitterInformation for how to cite the MTE bundle.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive dataset containing 598 verified Target Optical locations in United States with complete contact information, ratings, reviews, and location data.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Drug repositioning has shorter developmental time, lower cost and less safety risk than traditional drug development process. The current study aims to repurpose marketed drugs and clinical candidates for new indications in diabetes treatment by mining clinical ‘omics’ data. We analyzed data from genome wide association studies (GWAS), proteomics and metabolomics studies and revealed a total of 992 proteins as potential anti-diabetic targets in human. Information on the drugs that target these 992 proteins was retrieved from the Therapeutic Target Database (TTD) and 108 of these proteins are drug targets with drug projects information. Research and preclinical drug targets were excluded and 35 of the 108 proteins were selected as druggable proteins. Among them, five proteins were known targets for treating diabetes. Based on the pathogenesis knowledge gathered from the OMIM and PubMed databases, 12 protein targets of 58 drugs were found to have a new indication for treating diabetes. CMap (connectivity map) was used to compare the gene expression patterns of cells treated by these 58 drugs and that of cells treated by known anti-diabetic drugs or diabetes risk causing compounds. As a result, 9 drugs were found to have the potential to treat diabetes. Among the 9 drugs, 4 drugs (diflunisal, nabumetone, niflumic acid and valdecoxib) targeting COX2 (prostaglandin G/H synthase 2) were repurposed for treating type 1 diabetes, and 2 drugs (phenoxybenzamine and idazoxan) targeting ADRA2A (Alpha-2A adrenergic receptor) had a new indication for treating type 2 diabetes. These findings indicated that ‘omics’ data mining based drug repositioning is a potentially powerful tool to discover novel anti-diabetic indications from marketed drugs and clinical candidates. Furthermore, the results of our study could be related to other disorders, such as Alzheimer’s disease.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The dataset contains input and target variables for Waste-to-DME plant scenarios that were used to develop artificial neural networks. Applied waste types: (1) the organic fraction of municipal solid waste (OFMSW), (2) sewage sludge (SS).
Facebook
TwitterUnconventional epitopes presented by HLA class I complexes are emerging targets for T cell targeted immunotherapies. Their identification by mass spectrometry required development of novel methods to cope with the large number of theoretical candidates. Methods to identify post-translationally spliced peptides led to a broad range of outcomes. We here investigated the impact of three common database search engines – i.e. Mascot, Mascot+Percolator and PEAKS DB – as final identification step, as well as the features of target database on the ability to correctly identify non-spliced and cis-spliced peptides. We used ground truth datasets measured by mass spectrometry to benchmark methods’ performance and extended the analysis to HLA class I immunopeptidomes. PEAKS DB showed better precision and recall of cis-spliced peptides and larger number of identified peptides in HLA class I immunopeptidomes than the other search engine strategies. The better performance of PEAKS DB appears to result from better discrimination between target and decoy hits and hence a more robust FDR estimation, and seems independent to peptide and spectrum features here investigated. Head of the research group Molecular Immunology at King’s College London and the Francis Crick Institute, London (UK). Email: michele.mishto@kcl.ac.uk,
Facebook
TwitterSummary of MER contact science (CS) activities and targets. Localization information in Site Frame and within identifying images are provided where information was available. Contact science refers to analyses conducted by instruments deployed using the Instrument Deployment Device (IDD).
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Protein Structure Initiative - TargetTrack protein target registration database (795 MB, gzipped tarball)
The Protein Structure Initiative was a high-throughput structural genomics effort from 2000-2015 focused on developing technologies to enable greater coverage of protein structure space. Over its 15-year tenure, over 100 investigators at 35 centers (see ContributingCenters.xls) declared over 350,000 protein sequences (targets) that they would study using state-of-the-art protein production and structure determination methods. Many of these targets were selected through bioinformatics-based methods to serve as representatives for sequence and structure clusters.
From 2003-2010, these selected sequences and some basic identifying metadata were kept in a database called TargetDB, created at the Research Collaboratory for Structural Bioinformatics at Rutgers University. In 2008, a second database named PepcDB was created to track detailed experimental trial history and the standard protocols used by the PSI centers. These two databases became the principal structural genomics target databases, and were rolled into the PSI Structural Biology Knowledgebase in 2008.
As part of the third phase of the PSI, TargetDB and PepcDB were merged into a single resource, TargetTrack, to facilitate one-stop access to the data as well as expanding the schema to include new required data items. Participating centers deposited the latest status on their active targets and the protocols that were used (along with any deviations) on a weekly or quarterly basis. TargetTrack provided a variety of pre-computed data downloads on a weekly basis as well.
In July 2017, the Structural Biology Knowledgebase ceased operations. The files provided in this tarball represent the final datafiles generated by TargetTrack (timestamp June 30, 2017). Please read the README included in this dataset for descriptions of each file.
The entire TargetTrack datafile in XML format can be found in /TargetTrack XML files/tt.xml.gz
Key documentation can be found in the /Documentation folder.
TargetTrack schema: targetTrack-v1.4.1.pdf
Spreadsheet with TargetTrack enumerations for relevant fields: targetTrackEnumeratedDataItems-v1.4.1-1.xls
Image depicted the XML data schema: targetTrack-v1.4.1.jpg
These files are 868 MB in total size, uncompressed.
To open the tarball, use the command 'tar -zxvf TargetTrack-1Jul2017.tar.gz'
-- created by the PSI Structural Biology Knowledgebase, July 5, 2017
Facebook
TwitterThe DrugBank database is a bioinformatics and chemoinformatics resource that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information. This collection references target information from version 4 of the database.
Facebook
Twitterhttps://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html
MassiveFold data generated for CASP16, in collaboration with CAPRI . 8040 predictions were generated for the target = 1005 x 8 sets of parameters. - a description of the setup can be found in MassiveFold_CASP16_Abstract.pdf - README.txt describes the contents of the main MassiveFold.tar.gz file - the main MassiveFold.tar.gz file contains all the predictions, divided into 8 folders named after the conditions. It contains predictions as well as pickle files, sequence alignments, rankings and plots. The README.txt file describes the contents of this tar.gz. - theonly_pdbs_MassiveFold.tar.gzis the result of thegather_runs.pyscript, without the pickle files. It contains a list of all the pdb files and ranking files with scores. -gather_runs.pyallows to gather the runs, to use preferentially to the one included in the mainMassiveFold.tar.gzat the time of the prediction phase, because it has been updated -combined_scores.csv` file contains the CASP assessment for the target (from https://predictioncenter.org/)
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Comprehensive dataset containing 52 verified Target Distribution Center locations in United States with complete contact information, ratings, reviews, and location data.
Facebook
TwitterCREB target gene database that uses a multi-layered approach to predict, validate and characterize CREB target genes. For each gene, the database tries to provide the following information: 1. CREB binding sites on the promoters 2. Promoter occupancy by CREB 3. Gene activation by cAMP in tissues CREB seems to occupy a large number of promoters in the genome (up to ~5000 in human), and the profiles for CREB promoter occupancy are very similar in different human tissues. However, only a small proportion of CREB occupied genes are induced by cAMP in any cell type, possibly reflecting the requirement of additional regulatory partners that assist in recruitment of the transcriptional apparatus. To use the database, choose the species, select the table you want to search, leave field (''All'') and type in the gene you want to search. A table listing the search results will be returned, followed by the description of the table. If no search result is returned, try the official gene symbol or gene ID (locuslink number) from NCBI Entrez Gene to search. Sponsors: This work was supported by National Institutes of Health Grants GM RO1-037828 (to M.M.) and DK068655 (to R.A.Y.).
Facebook
TwitterTarget fishing is the process of identifying the protein target of a bioactive small molecule. To do so experimentally requires a significant investment of time and resources, which can be expedited with a reliable computational target fishing model. The development of computational target fishing models using machine learning has become very popular over the last several years because of the increased availability of large amounts of public bioactivity data. Unfortunately, the applicability and performance of such models for natural products has not yet been comprehensively assessed. This is, in part, due to the relative lack of bioactivity data available for natural products compared to synthetic compounds. Moreover, the databases commonly used to train such models do not annotate which compounds are natural products, which makes the collection of a benchmarking set difficult. To address this knowledge gap, a data set composed of natural product structures and their associated protein targets was generated by cross-referencing 20 publicly available natural product databases with the bioactivity database ChEMBL. This data set contains 5589 compound–target pairs for 1943 unique compounds and 1023 unique targets. A synthetic data set comprising 107 190 compound–target pairs for 88 728 unique compounds and 1907 unique targets was used to train k-nearest neighbors, random forest, and multilayer perceptron models. The predictive performance of each model was assessed by stratified 10-fold cross-validation and benchmarking on the newly collected natural product data set. Strong performance was observed for each model during cross-validation with area under the receiver operating characteristic (AUROC) scores ranging from 0.94 to 0.99 and Boltzmann-enhanced discrimination of receiver operating characteristic (BEDROC) scores from 0.89 to 0.94. When tested on the natural product data set, performance dramatically decreased with AUROC scores ranging from 0.70 to 0.85 and BEDROC scores from 0.43 to 0.59. However, the implementation of a model stacking approach, which uses logistic regression as a meta-classifier to combine model predictions, dramatically improved the ability to correctly predict the protein targets of natural products and increased the AUROC score to 0.94 and BEDROC score to 0.73. This stacked model was deployed as a web application, called STarFish, and has been made available for use to aid in target identification for natural products.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Drug target proteins identified from Drugbank and Therapeutic target database.
Facebook
TwitterCentral database to house data on morpholino screens currently containing over 700 morpholinos including control and multiple morpholinos against the same target. A publicly accessible sequence-based search opens this database for morpholinos against a particular target for the zebrafish community. Morpholino Screens: They set out to identify all cotranslationally translocated genes in the zebrafish genome (Secretome/CTT-ome). Morpholinos were designed against putative secreted/CTT targets and injected into 1-4 cell stage zebrafish embryos. The embryos were observed over a 5 day period for defects in several different systems. The first screen examined 184 gene targets of which 26 demonstrated defects of interest (Pickart et al. 2006). A collaboration with the Verfaillie laboratory examined the knockdown of targets identified in a comparative microarray analysis of hematopoietic stem cells demonstrating how microarray and morpholino technologies can be used in conjunction to enrich for defects in specific developmental processes. Currently, many collaborations are underway to identify genes involved in morphological, kidney, skin, eye, pigment, vascular and hematopoietic development, lipid metabolism and more. The screen types referred to in the search functions are the specific areas of development that were examined during the various screens, which include behavior, general morphology, pigmentation, toxicity, Pax2 expression, and development of the craniofacial structures, eyes, kidneys, pituitary, and skin. Only data pertaining to specific tests performed are presented. Due to the complexity of this international collaboration and time constraints, not all morpholinos were subjected to all screen types. They are currently expanding public access to the database. In the future we will provide: * Mortality curves and dose range for each morpholino * Preliminary data regarding the effectiveness of each morpholino * Expanded annotation for each morpholino * External linkage of our morpholino sequences to ZFIN and Ensembl. To submit morpholino-knockdown results to MODB please contact the administrator for a user name and password.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
COlleCtion of Open NatUral producTs (COCONUT) is an aggregated dataset comprising elucidated and predicted natural products (NPs) from open repositories. It offers a user-friendly web interface for browsing, searching, and efficiently downloading NPs. The latest database integrates more than 63 open NP resources, providing unrestricted access to data free of charge. Each entry in the database represents a "flat" NP structure, accompanied by information on its known stereochemical forms, relevant literature, producing organisms, natural geographical distribution, and precomputed molecular properties.
Natural products are small bioactive molecules produced by living organisms with potential applications in pharmacology and various industries. The significance of these compounds has driven global interest in NP research across diverse fields. However, despite the growing number of general and specialized NP databases, no comprehensive online resource has consolidated all known NPs in one place—until COCONUT. This became a resource facilitating NP research, enabling computational screening and other in-silico applications.
| Total Molecules | Total Collections | Unique Organisms | Citations Mapped |
|---|---|---|---|
| 621,631 | 63 | 55,252 |
24,272 |
| S.No | Database name | Entries integrated in COCONUT | Latest resource URL |
|---|---|---|---|
| 1 | AfroCancer | 390 | Fidele Ntie-Kang, Justina Ngozi Nwodo, Akachukwu Ibezim, Conrad Veranso Simoben, Berin Karaman, Valery Fuh Ngwa, Wolfgang Sippl, Michael Umale Adikwu, and Luc Meva’a Mbaze Journal of Chemical Information and Modeling 2014 54 (9), 2433-2450 https://doi.org/10.1021/ci5003697 |
| 2 | AfroDB | 953 | Fidele Ntie-Kang ,Denis Zofou,Smith B. Babiaka,Rolande Meudom,Michael Scharfe,Lydia L. Lifongo,James A. Mbah,Luc Meva’a Mbaze,Wolfgang Sippl,Simon M. N. Efange https://doi.org/10.1371/journal.pone.0078085 |
| 3 | AfroMalariaDB | 265 | Onguéné, P.A., Ntie-Kang, F., Mbah, J.A. et al. The potential of anti-malarial compounds derived from African medicinal plants, part III: an in silico evaluation of drug metabolism and pharmacokinetics profiling. Org Med Chem Lett 4, 6 (2014). https://doi.org/10.1186/s13588-014-0006-x |
| 4 | AnalytiCon Discovery NPs | 5,147 | Natural products are a sebset of AnalytiCon Discovery NPs https://ac-discovery.com/screening-libraries/ |
| 5 | BIOFACQUIM | 605 | Pilón-Jiménez, B.A.; Saldívar-González, F.I.; Díaz-Eufracio, B.I.; Medina-Franco, J.L. BIOFACQUIM: A Mexican Compound Database of Natural Products. Biomolecules 2019, 9, 31. https://doi.org/10.3390/biom9010031 |
| 6 | BitterDB | 685 | Ayana Dagan-Wiener, Antonella Di Pizio, Ido Nissim, Malkeet S Bahia, Nitzan Dubovski, Eitan Margulis, Masha Y Niv, BitterDB: taste ligands and receptors database in 2019, Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D1179–D1185, https://doi.org/10.1093/nar/gky974 |
| 7 | Carotenoids Database | 1,195 | Junko Yabuzaki, Carotenoids Database: structures, chemical fingerprints and distribution among organisms, Database, Volume 2017, 2017, bax004, https://doi.org/10.1093/database/bax004 |
| 8 | ChEBI NPs | 16,215 | Janna Hastings, Paula de Matos, Adriano Dekker, Marcus Ennis, Bhavana Harsha, Namrata Kale, Venkatesh Muthukrishnan, Gareth Owen, Steve Turner, Mark Williams, Christoph Steinbeck, The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013, Nucleic Acids Research, Volume 41, Issue D1, 1 January 2013, Pages D456–D463, https://doi.org/10.1093/nar/gks1146 |
| 9 | ChEMBL NPs | 1,910 | Anna Gaulton, Anne Hersey, Michał Nowotka, A. Patrícia Bento, Jon Chambers, David Mendez, Prudence Mutowo, Francis Atkinson, Louisa J. Bellis, Elena Cibrián-Uhalte, Mark Davies, Nathan Dedman, Anneli Karlsson, María Paula Magariños, John P. Overington, George Papadatos, Ines Smit, Andrew R. Leach, The ChEMBL database in 2017, Nucleic Acids Research, Volume 45, Issue D1, January 2017, Pages D945–D954, https://doi.org/10.1093/nar/gkw1074 |
| 10 | ChemSpider NPs | 9,740 | Harry E. Pence and Antony Williams Journal of Chemical Education 2010 87 (11), 1123-1124 https://doi.org/10.1021/ed100697w |
| 11 | CMAUP (cCollective molecular activities of useful plants) | 47,593 | Xian Zeng, Peng Zhang, Yali Wang, Chu Qin, Shangying Chen, Weidong He, Lin Tao, Ying Tan, Dan Gao, Bohua Wang, Zhe Chen, Weiping Chen, Yu Yang Jiang, Yu Zong Chen, CMAUP: a database of collective molecular activities of useful plants, Nucleic Acids Research, Volume 47, Issue D1, 08 January 2019, Pages D1118–D1127, https://doi.org/10.1093/nar/gky965 |
| 12 | ConMedNP | 3,111 | DOI https://doi.org/10.1039/C3RA43754J |
| 13 | ETM (Ethiopian Traditional Medicine) DB | 1,798 | Bultum, L.E., Woyessa, A.M. & Lee, D. ETM-DB: integrated Ethiopian traditional herbal medicine and phytochemicals database. BMC Complement Altern Med 19, 212 (2019). https://doi.org/10.1186/s12906-019-2634-1 |
| 14 | Exposome-explorer | 434 | Vanessa Neveu, Alice Moussy, Héloïse Rouaix, Roland Wedekind, Allison Pon, Craig Knox, David S. Wishart, Augustin Scalbert, Exposome-Explorer: a manually-curated database on biomarkers of exposure to dietary and environmental factors, Nucleic Acids Research, Volume 45, Issue D1, January 2017, Pages D979–D984, https://doi.org/10.1093/nar/gkw980 |
| 15 | FoodDB | 70,385 | Natural products are a sebset of FoodDB https://foodb.ca/ |
| 16 | GNPS (Global Natural Products Social Molecular Networking) | 11,103 | Wang, M., Carver, J., Phelan, V. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol 34, 828–837 (2016). https://doi.org/10.1038/nbt.3597 |
| 17 | HIM (Herbal Ingredients in-vivo Metabolism database) | 1,259 | Kang, H., Tang, K., Liu, Q. et al. HIM-herbal ingredients in-vivo metabolism database. J Cheminform 5, 28 (2013). https://doi.org/10.1186/1758-2946-5-28 |
| 18 | HIT (Herbal Ingredients Targets) | 530 | Hao Ye, Li Ye, Hong Kang, Duanfeng Zhang, Lin Tao, Kailin Tang, Xueping Liu, Ruixin Zhu, Qi Liu, Y. Z. Chen, Yixue Li, Zhiwei Cao, HIT: linking herbal active ingredients to targets, Nucleic Acids Research, Volume 39, Issue suppl_1, 1 January 2011, Pages D1055–D1059, https://doi.org/10.1093/nar/gkq1165 |
| 19 | Indofine Chemical Company | 46 | Natural products are a sebset of Indofine Chemical Company https://indofinechemical.com/ |
| 20 | InflamNat | 664 | Ruihan Zhang, Jing Lin, Yan Zou, Xing-Jie Zhang, and Wei-Lie Xiao Journal of Chemical Information and Modeling 2019 59 (1), 66-73 DOI: 10.1021/acs.jcim.8b00560 <a href="https://doi.org/10.1021/acs.jcim.8b00560" target="_blank" rel="noopener |
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Target store USA products dataset having 168 records with 24 datapoint.
Datapoints: title url brand main_image sku description raw_description gtin13 currency price availability availableDeliveryMethod available_branch primary_category sub_category_1 sub_category_2 sub_category_3 images raw_specifications specifications highlights raw_highlights uniq_id scraped_at
Get complete dataset from crawl feeds over more than 800K+ records