Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
An in-house AMP database including fish AMPs identified in our lab, as well as peptides from the public databases.
The database is classified into two groups, the short AMP group (≤30aa) and the long AMP group (>30). Choose appropriate parameters for blast when searching the database!
Suggestions: For short AMP searching, add -task blastn-short, and -word_size ** -evalue ** For long AMP searching, use defaul or modify the parameters if you think it is needed.
https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html
Leguminous crops are vital to sustainable agriculture due to their ability to fix atmospheric nitrogen, improving soil fertility and reducing the need for synthetic fertilizers. Additionally, they are an excellent source of protein for both human consumption and animal feed. AntiMicrobial Peptides (AMPs), found in various leguminous seeds, exhibit broad-spectrum antimicrobial activity through diverse mechanisms, including interaction with microbial cell membranes and interference with cellular processes, making them valuable for enhancing crop resilience and food safety. In the field of plant sciences, computational biology methods have been instrumental in the discovery and optimization of AMPs. These methods enable rapid exploration of sequence space and the prediction of AMPs using deep learning technologies. Optimizing AMP annotations through computational design offers a strategic approach to enhance efficacy and minimize potential side effects, providing a viable alternative to conventional antimicrobial agents. However, the presence of overlapping sequences across multiple databases poses a challenge for creating a reliable dataset for AMP prediction. To address this, we conducted a comprehensive analysis of sequence redundancy across various AMP databases. These databases encompass a wide range of AMPs from different sources and with specific functions, including both naturally occurring and artificially synthesized AMPs. Our analysis revealed significant overlap, underscoring the need for a non-redundant AMP sequence database. We present the development of a new database that consolidates unique AMP sequences derived from leguminous seeds, aiming to create a more refined dataset for the binary classification and prediction of plant-derived AMPs. This database will support the advancement of sustainable agricultural practices by enhancing the use of plant-based AMPs in agroecology, contributing to improved crop protection and food security.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Antimicrobial peptides (AMPs) are naturally produced by pro- and eukaryotes and are promising alternatives to antibiotics to fight multidrug-resistant microorganisms. However, despite thousands of AMP entries in respective databases, predictions about their structure–activity relationships are still limited. Similarly, common or dissimilar properties of AMPs that have evolved in different taxonomic groups are nearly unknown. We leveraged data entries for 10,987 peptides currently listed in the three antimicrobial peptide databases APD, DRAMP and DBAASP to aid structure–activity predictions. However, this number reduced to 3,828 AMPs that we could use for computational analyses, due to our stringent quality control criteria. The analysis uncovered a strong bias towards AMPs isolated from amphibians (1,391), whereas only 35 AMPs originate from fungi (0.9%), hindering evolutionary analyses on the origin and phylogenetic relationship of AMPs. The majority (62%) of the 3,828 AMPs consists of less than 40 amino acids but with a molecular weight higher than 2.5 kDa, has a net positive charge and shares a hydrophobic character. They are enriched in glycine, lysine and cysteine but are depleted in glutamate, aspartate and methionine when compared with a peptide set of the same size randomly selected from the UniProt database. The AMPs that deviate from this pattern (38%) can be found in different taxonomic groups, in particular in Gram-negative bacteria. Remarkably, the γ-core motif claimed so far as a unifying structural signature in cysteine-stabilised AMPs is absent in nearly 90% of the peptides, questioning its relevance as a prerequisite for antimicrobial activity. The disclosure of AMPs pattern and their variation in producing organism groups extends our knowledge of the structural diversity of AMPs and will assist future peptide screens in unexplored microorganisms. Structural design of peptide antibiotic drugs will benefit using natural AMPs as lead compounds. However, a reliable and statistically balanced database is missing which leads to a large knowledge gap in the AMP field. Thus, thorough evaluation of the available data, mitigation of biases and standardised experimental setups need to be implemented to leverage the full potential of AMPs for drug development programmes in the clinics and agriculture.
Drugs that have been reported under the Medicaid Drug Rebate Program along with an indication of whether or not the required Average Manufacturer Price (AMP) was reported for each drug. All drugs are identified in the file by the 11-digit National Drug Code, product name, labeler name, and reported (R) or not reported (NR).
This table contains information about transactions. A transaction tracks the movement of parts and materials between store houses and work orders. Each record represents a single transaction. The primary key is TRA_CODE. When TRA_TOENTITY is “EVNT” then you can use the TRA_TOCODE field to join to the Work Orders table on EVT_CODE to know which transactions are associated with each work order. For the User Guide, please follow this link For the Data Dictionary, please follow this link
This table is the primary table for information about work orders, and contains general information - including a description of the work, assigned title, request date and completion date - about each work order. Each row represents a single work order. The primary key field is EVT_CODE. The EVT_OBJECT field can be joined to the Assets table on OBJ_CODE to know which asset the work order was for. For the User Guide, please follow this link For the Data Dictionary, please follow this link
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
# AMP Dataset Documentation
This repository contains several datasets used for conducting ProteoGPT, AMPSorter, AMPGenix and BioToxiPept and datasets conducted with prediction results. Below is a description of each dataset included in this repository.
## Datasets
### 1. `uniprot-compressed_true_download_true_format_fasta_includeIsoform_tr-2022.10.13-02.51.31.70.fasta`
- **Description**: 609,216 non-redundant canonical and isoform protein sequences.
### 2. `protein_seqs_1000.json`
- **Description**: Contains the training data for ProteoGPT.
### 3. `amp_unique_16062.json`
- **Description**: Contains the training data for AMPGenix.
### 4. `AMPSorter&BioToxiPept dataset.xlsx`
- **Description**: Contains the fine-tuning data, including:
- **AMP_data split**: Data used for training, validating and evaluating AMPSorter.
- **AMP_benchmarking Set**: A set of peptides used for benchmarking the AMP models.
- **AMP_external Validation Dataset**: A separate dataset for external model validation for AMPSorter.
- **Toxin_data split**: Data used for training, validating and evaluating BioToxiPept.
- **Toxin_test set**: A set of peptides used for testing the toxin models.
### 5. `NRSPDs`
- **Description**: A large dataset that includes:
- **410,192,277 non-redundant short peptides**.
- **A candidate pool of 82,694,928 peptides**.
- **Logits** with results predicted by AMPsorter and BioToxiPept.
### 6. `GNRSPDs`
- **Description**: Contains:
- **7,798 generated sequences**.
- **A candidate pool of 4,736 peptides**.
- **Logits**with results predicted by AMPsorter and BioToxiPept.
### 7. `196 tested peptides.xlsx`
- **Description**: A set of 196 selected and experimentally tested peptides, with experimentally measured values.
### 8. `20 pilot tested peptides.xlsx`
- **Description**: 20 selected peptides with prediction results and experimentally values measured in pilot test.
### 9. `Sequences generated by different models.xlsx`
- **Description**: Sequences generated by different models with prediction results.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
125917 Global import shipment records of Amp from United States with prices, volume & current Buyer’s suppliers relationships based on actual Global import trade database.
https://choosealicense.com/licenses/other/https://choosealicense.com/licenses/other/
Retargeted Robot Motion Dataset
This dataset provides retargeted motion capture sequences for a variety of robotic platforms. The motion data is derived from the CMU Motion Capture Database and includes a wide range of motion types beyond locomotion — such as gestures, interactions, and full-body activities. The data has been adapted to match the kinematic structure of specific robots, enabling its use in tasks such as:
Imitation learning Reinforcement learning Motion analysis… See the full description on the dataset page: https://huggingface.co/datasets/ami-iit/amp-dataset.
The antimicrobial peptides (AMP) have been proposed as an alternative to control resistant pathogens. However, due to multifunctional properties of several AMP classes, until now there has been no way to perform efficient AMP identification, except through in vitro and in vivo tests. Nevertheless, an indication of activity can be provided by prediction methods. In order to contribute to the AMP prediction field, the CS-AMPPred (Cysteine-Stabilized Antimicrobial Peptides Predictor) is presented here, consisting of an updated version of the Support Vector Machine (SVM) model for antimicrobial activity prediction in cysteine-stabilized peptides. The CS-AMPPred is based on five sequence descriptors: indexes of (i) ?-helix and (ii) loop formation; and averages of (iii) net charge, (iv) hydrophobicity and (v) flexibility. CS-AMPPred was based on 310 cysteine-stabilized AMPs and 310 sequences extracted from PDB. The polynomial kernel achieves the best accuracy on 5-fold cross validation (85.81%), while the radial and linear kernels achieve 84.19%. Testing in a blind data set, the polynomial and radial kernels achieve an accuracy of 90.00%, while the linear model achieves 89.33%. The three models reach higher accuracies than previously described methods. A standalone version of CS-AMPPred is available for download at and runs on any Linux machine.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
AMP reported AUD1.68B in Sales Revenues for its fiscal semester ending in June of 2024. Data for AMP - Sales Revenues including historical, tables and charts were last updated by Trading Economics this last July in 2025.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
AMP reported AUD1.54B in Operating Expenses for its fiscal semester ending in June of 2024. Data for AMP - Operating Expenses including historical, tables and charts were last updated by Trading Economics this last July in 2025.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
153 Global import shipment records of Amps with prices, volume & current Buyer's suppliers relationships based on actual Global export trade database.
Computed materials data using density functional theory calculations. These calculations determine the electronic structure of bulk materials by solving approximations to the Schrodinger equation. For more information, see https://materialsproject.org/docs/calculations
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
AMP reported 4.46 in Dividend Yield for its fiscal semester ending in June of 2024. Data for AMP - Dividend Yield including historical, tables and charts were last updated by Trading Economics this last July in 2025.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
AMP reported AUD27.78B in Debt for its fiscal semester ending in June of 2024. Data for AMP - Debt including historical, tables and charts were last updated by Trading Economics this last July in 2025.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The Triticeae Toolbox Oat (T3 Oat) is the repository of oat phenotype and genotype data for the Global Oat Genetics Database.
The Triticeae Toolbox (T3) is the web portal for data generated by the Triticeae Coordinated Agricultural Project (T-CAP), funded by the National Institute for Food and Agriculture (NIFA) of the United States Department of Agriculture (USDA). It also contains data from US Uniform Regional Nurseries, supported by the US Wheat and Barley Scab Initiative. The database was initially developed as The Hordeum Toolbox (THT) to hold barley data generated by the Barley CAP project (2006-2010). T3 Barley hods data generated for Hordeum vulgare L. T3 Wheat holds data generated for Triticum spp. T3 Oat holds data generated for Avena. All are being enhanced in database performance, community curation and user tools. T3 contains germplasm line information, pedigree, genotype and phenotypic data from breeding programs participating in the CAP and core germplasm collections maintained by the USDA National Small Grains Collection.
Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
AMP-deaminase (AMPD) ([ec:3.5.4.6]) is a large, well-conserved eukaryotic protein that catalyses the hydrolytic deamination of adenosine monophosphate (AMP) to inosine monophosphate (IMP), and so plays an important role in purine and energy metabolism . This entry also includes inactive deaminases from yeast, which lack the conserved His residues essential for binding the catalytic zinc ion and the conserved residues important for substrate binding and catalysis.
This dataset provides the number, surface area, and volume concentrations and size distributions of dry aerosol particles measured by the Aerosol Microphysical Properties (AMP) instrument package during airborne campaigns conducted by NASA's Atmospheric Tomography (ATom) mission. Five instruments--two nucleation-mode aerosol size spectrometers (NMASS), two ultra-high sensitivity aerosol spectrometers (UHSAS), and a laser aerosol spectrometer (LAS)--comprise the AMP package. The AMP payload provides size distributions with up to one-second time resolution for dry aerosol particles between 0.003 and 4.8 microns in diameter.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
An in-house AMP database including fish AMPs identified in our lab, as well as peptides from the public databases.
The database is classified into two groups, the short AMP group (≤30aa) and the long AMP group (>30). Choose appropriate parameters for blast when searching the database!
Suggestions: For short AMP searching, add -task blastn-short, and -word_size ** -evalue ** For long AMP searching, use defaul or modify the parameters if you think it is needed.