5 datasets found
  1. f

    DataSheet1_Training machine learning models with synthetic data improves the...

    • datasetcatalog.nlm.nih.gov
    • frontiersin.figshare.com
    Updated Aug 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastian, Rafael; Camara, Oscar; Mont, Lluis; Penela, Diego; Doste, Ruben; Lozano, Miguel; Berruezo, Antonio; Jimenez-Perez, Guillermo (2022). DataSheet1_Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias.PDF [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000449397
    Explore at:
    Dataset updated
    Aug 12, 2022
    Authors
    Sebastian, Rafael; Camara, Oscar; Mont, Lluis; Penela, Diego; Doste, Ruben; Lozano, Miguel; Berruezo, Antonio; Jimenez-Perez, Guillermo
    Description

    In order to determine the site of origin (SOO) in outflow tract ventricular arrhythmias (OTVAs) before an ablation procedure, several algorithms based on manual identification of electrocardiogram (ECG) features, have been developed. However, the reported accuracy decreases when tested with different datasets. Machine learning algorithms can automatize the process and improve generalization, but their performance is hampered by the lack of large enough OTVA databases. We propose the use of detailed electrophysiological simulations of OTVAs to train a machine learning classification model to predict the ventricular origin of the SOO of ectopic beats. We generated a synthetic database of 12-lead ECGs (2,496 signals) by running multiple simulations from the most typical OTVA SOO in 16 patient-specific geometries. Two types of input data were considered in the classification, raw and feature ECG signals. From the simulated raw 12-lead ECG, we analyzed the contribution of each lead in the predictions, keeping the best ones for the training process. For feature-based analysis, we used entropy-based methods to rank the obtained features. A cross-validation process was included to evaluate the machine learning model. Following, two clinical OTVA databases from different hospitals, including ECGs from 365 patients, were used as test-sets to assess the generalization of the proposed approach. The results show that V2 was the best lead for classification. Prediction of the SOO in OTVA, using both raw signals or features for classification, presented high accuracy values (>0.96). Generalization of the network trained on simulated data was good for both patient datasets (accuracy of 0.86 and 0.84, respectively) and presented better values than using exclusively real ECGs for classification (accuracy of 0.84 and 0.76 for each dataset). The use of simulated ECG data for training machine learning-based classification algorithms is critical to obtain good SOO predictions in OTVA compared to real data alone. The fast implementation and generalization of the proposed methodology may contribute towards its application to a clinical routine.

  2. Success.ai | EU Company Data | Enrichment APIs | 28M+ Full Company Profiles...

    • datarade.ai
    Updated Oct 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2024). Success.ai | EU Company Data | Enrichment APIs | 28M+ Full Company Profiles & Contact Data – Best Price & Quality Guarantee [Dataset]. https://datarade.ai/data-products/success-ai-eu-company-data-enrichment-apis-28m-full-co-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 20, 2024
    Dataset provided by
    Area covered
    Philippines, Hong Kong, Haiti, Benin, Tunisia, Turkmenistan, Iran (Islamic Republic of), Tuvalu, Cabo Verde, Faroe Islands
    Description

    Success.ai offers a powerful platform for accessing extensive EU company data, designed to meet the dynamic marketing and advertising needs across diverse industries. This specialized dataset includes detailed profiles of over 28 million companies, from burgeoning startups to established private firms, tailored to support precise data enrichment and targeted marketing.

    Enrichment API Capabilities:

    • Seamless Data Integration: Utilize our enrichment APIs to integrate and update your systems with real-time data, enhancing data accuracy and utility.
    • Custom API Solutions: Tailor API services to your specific needs, ensuring you receive the most relevant data for your business initiatives.

    Key Benefits:

    • Diverse Data Collection: Our comprehensive database covers a broad spectrum of EU company data, offering rich insights for varied marketing strategies.
    • Tailored for Marketing Excellence: Maximize engagement and ROI in your email marketing, B2B marketing, and advertising campaigns with high-quality, targeted data.
    • Global Reach with Local Insights: Our data spans across the EU, providing detailed insights tailored to both global strategies and localized campaigns.
    • Actionable Insights for Strategic Marketing: Refine your marketing tactics with precise data, reaching the right audience with the right message.

    Key Use Cases Leveraged by Success.ai:

    • Data Enrichment: Enhance your database quality by integrating detailed EU company data, ensuring accuracy and relevance in your marketing efforts.
    • Email Marketing: Utilize refined data to craft personalized email campaigns that resonate with your target audience, driving better engagement and conversions.
    • B2B Marketing: Access detailed EU company profiles to design bespoke B2B marketing strategies that reach decision-makers effectively.
    • Advertising: Leverage precise company insights to optimize ad targeting, maximizing your advertising spend and improving campaign performance.
    • Comprehensive Marketing Support: Utilize our data to support a wide range of marketing activities, from lead generation to brand awareness campaigns.

    Why Choose Success.ai?

    • Best Price & Quality Guarantee: We ensure you receive the highest value for your data investment, beating any competitor’s pricing.
    • Advanced Validation Techniques: Our datasets are verified through sophisticated AI algorithms, ensuring a 99% accuracy rate.
    • Customized Data Solutions: Receive datasets tailored to your specific needs, from broad market trends to niche market data.
    • Compliance and Integrity: All our data practices adhere to GDPR and other international regulations, ensuring ethical usage and peace of mind.

    Get Started with Success.ai Today: Let Success.ai transform your marketing and advertising strategies with our comprehensive and reliable EU company data. Contact us to discover how our tailored solutions can help you achieve your business goals and maintain a competitive edge.

    And no one beats us on price. Period.

  3. A Stochastic Individual-Based Model of the Progression of Atrial...

    • figshare.com
    pdf
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eugene T. Y. Chang; Yen Ting Lin; Tobias Galla; Richard H. Clayton; Julie Eatock (2023). A Stochastic Individual-Based Model of the Progression of Atrial Fibrillation in Individuals and Populations [Dataset]. http://doi.org/10.1371/journal.pone.0152349
    Explore at:
    pdfAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Eugene T. Y. Chang; Yen Ting Lin; Tobias Galla; Richard H. Clayton; Julie Eatock
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Models that represent the mechanisms that initiate and sustain atrial fibrillation (AF) in the heart are computationally expensive to simulate and therefore only capture short time scales of a few heart beats. It is therefore difficult to embed biophysical mechanisms into both policy-level disease models, which consider populations of patients over multiple decades, and guidelines that recommend treatment strategies for patients. The aim of this study is to link these modelling paradigms using a stylised population-level model that both represents AF progression over a long time-scale and retains a description of biophysical mechanisms. We develop a non-Markovian binary switching model incorporating three different aspects of AF progression: genetic disposition, disease/age related remodelling, and AF-related remodelling. This approach allows us to simulate individual AF episodes as well as the natural progression of AF in patients over a period of decades. Model parameters are derived, where possible, from the literature, and the model development has highlighted a need for quantitative data that describe the progression of AF in population of patients. The model produces time series data of AF episodes over the lifetimes of simulated patients. These are analysed to quantitatively describe progression of AF in terms of several underlying parameters. Overall, the model has potential to link mechanisms of AF to progression, and to be used as a tool to study clinical markers of AF or as training data for AF classification algorithms.

  4. EMA 65 Crossover

    • kaggle.com
    zip
    Updated May 9, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    daytrader.ai (2018). EMA 65 Crossover [Dataset]. https://www.kaggle.com/daytrader/ema-65-crossover
    Explore at:
    zip(936739600 bytes)Available download formats
    Dataset updated
    May 9, 2018
    Dataset provided by
    daytrader.ai
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    Day traders identify patterns in the market that tell them when to enter and exit a trade. They never hold market position over night meaning profit targets for trades are expressed in minutes not days. To further back their technical analysis they view the data in multiple time domains trying to locate so called "support" and "resistance" levels. Many of the technical indicators are based on price alone.

    This data set should allow a machine algorithm to form better technical indicators then a human, thus allowing it to predict probabilities for entry conditions better then a human day trader.

    For this data set we analysed 7 years of NASDAQ100 data (from 2010 to mid 2017). Every morning we wait until 90 minutes of trading history has occurred before scanning for a simple pattern. This pattern is when the 15 minute EMA crosses over the 65 minute EMA.

    Symbols included in the search:

    • FB - Facebook
    • BABA - Alibaba
    • GOOG - Google class C
    • AAPL - Apple
    • TSLA - Tesla
    • MSFT - Microsoft
    • NVDA - NVidia
    • AMZN - Amazon
    • CRM - Salesforce
    • GOOGL - Google class A
    • ADBE - Adobe
    • NFLX - Netflix
    • INTC - Intel
    • BIDU - Baidu

    Content

    Once the pattern has been detected I give you 2400 minute (40 hours) of previous history. As well as 20 minutes of future history.
    The data will be formatted as follows.

    File name: data_**N**_**SYM**.csv

    • N is an incremental integer
    • SYM is the stock symbol ticker (FB, BABA, ect..)

    Inside each of the csv files you will find 2420 lines of comma separated values, with format:

    ISO formatted date, closing price, volume.

    Eg:

    2017-10-17T14:18:00.000Z,201.87,55800.0
    2017-10-17T14:19:00.000Z,201.21,137786.0
    2017-10-17T14:20:00.000Z,201.852,103695.0
    2017-10-17T14:21:00.000Z,201.6,81362.0
    2017-10-17T14:22:00.000Z,201.54,30183.0
    2017-10-17T14:23:00.000Z,201.43,72405.0
    2017-10-17T14:24:00.000Z,201.15,79411.0
    2017-10-17T14:25:00.000Z,201.48,125713.0
    

    The task should report a probability that this will be a successful trade or not.

    Further note: One should keep in mind that there are trading fees involved for the entry and exit of the trade. So in order to profile you will need to beat this spread.

    Acknowledgements

    Further ideas and questions can be directed to http://daytrader.ai Thanks and I hope you have some fun with this set :) blog: https://medium.com/@coreyauger/daytrader-ai-machine-learning-applied-to-intraday-trading-a6b4e44b0274

    Inspiration

    The task should try to predict the label with the lowest possible error Useful links:

  5. 🎶 10k Song Dataset

    • kaggle.com
    zip
    Updated Aug 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    mexwell (2023). 🎶 10k Song Dataset [Dataset]. https://www.kaggle.com/datasets/mexwell/10k-song-dataset/code
    Explore at:
    zip(1194420 bytes)Available download formats
    Dataset updated
    Aug 14, 2023
    Authors
    mexwell
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    This library comes from the Million Song Dataset, which used a company called the Echo Nest to derive data points about one million popular contemporary songs. The Million Song Dataset is a collaboration between the Echo Nest and LabROSA, a laboratory working towards intelligent machine listening. The project was also funded in part by the National Science Foundation of America (NSF) to provide a large data set to evaluate research related to algorithms on a commercial size while promoting further research into the Music Information Retrieval field. The data contains standard information about the songs such as artist name, title, and year released. Additionally, the data contains more advanced information; for example, the length of the song, how many musical bars long the song is, and how long the fade in to the song was.

    Data Dictionary

    ...

    KeyList of...CommentExample Value
    artist.familiarityFloatA measure of 0..1 for how familiar the artist is to listeners.0.581793766
    artist.hotttnesssFloatA measure of the artists's popularity, when downloaded (in December 2010). Measured on a scale of 0 to 1.0.401997543
    artist.idStringA unique ID for this artist."ARD7TVE1187B99BFB1"
    artist.latitudeFloatThe home location's latitude of this artist.0.0
    artist.locationIntegerUnknown.0
    artist.longitudeFloatThe home location's longitude of this artist.0.0
    artist.nameStringThe name of the artist."Casual"
    artist.similarFloatUnknown.0.0
    artist.termsStringThe term most associated with this artist."hip hop"
    artist.terms_freqFloatThe frequency of this term.1.0
    release.idIntegerThe ID of the release (album) on the service 7digital.com300848
    release.nameIntegerUnknown value0
    song.artist_mbtagsFloatUnknown field.0.0
    song.artist_mbtags_countFloatNumber of tags for the artist on mbtags.0.0
    song.bars_confidenceFloatConfidence value (between 0 and 1) associated with each bar.0.643
    song.bars_startFloatAverage start time of each bar, measured in bars.0.58521
    song.beats_confidenceFloatAverage confidence interval of the beats.0.834
    song.beats_startFloatAverage start time of each beat, measured in beats.0.58521
    song.durationFloatDuration of the track in seconds.218.93179
    song.end_of_fade_inFloatTime of the end of the fade in, at the beginning of the song.0.247
    song.hotttnesssFloatA measure of the song's popularity, when downloaded (in December 2010). Measured on a scale of 0 to 1.0.60211999
    song.idStringA uniquely identifying number for the song."SOMZWCG12A8C13C480"
    song.keyFloatEstimation of the key the song is in. Keys can be from 0 to 11.1.0
    song.key_confidenceFloatConfidence value (between 0 and 1) of the key estimation.0.736
    song.loudnessFloatGeneral loudness of the track-11.197
    song.modeIntegerEstimation of the mode the song.
  6. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sebastian, Rafael; Camara, Oscar; Mont, Lluis; Penela, Diego; Doste, Ruben; Lozano, Miguel; Berruezo, Antonio; Jimenez-Perez, Guillermo (2022). DataSheet1_Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias.PDF [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000449397

DataSheet1_Training machine learning models with synthetic data improves the prediction of ventricular origin in outflow tract ventricular arrhythmias.PDF

Explore at:
Dataset updated
Aug 12, 2022
Authors
Sebastian, Rafael; Camara, Oscar; Mont, Lluis; Penela, Diego; Doste, Ruben; Lozano, Miguel; Berruezo, Antonio; Jimenez-Perez, Guillermo
Description

In order to determine the site of origin (SOO) in outflow tract ventricular arrhythmias (OTVAs) before an ablation procedure, several algorithms based on manual identification of electrocardiogram (ECG) features, have been developed. However, the reported accuracy decreases when tested with different datasets. Machine learning algorithms can automatize the process and improve generalization, but their performance is hampered by the lack of large enough OTVA databases. We propose the use of detailed electrophysiological simulations of OTVAs to train a machine learning classification model to predict the ventricular origin of the SOO of ectopic beats. We generated a synthetic database of 12-lead ECGs (2,496 signals) by running multiple simulations from the most typical OTVA SOO in 16 patient-specific geometries. Two types of input data were considered in the classification, raw and feature ECG signals. From the simulated raw 12-lead ECG, we analyzed the contribution of each lead in the predictions, keeping the best ones for the training process. For feature-based analysis, we used entropy-based methods to rank the obtained features. A cross-validation process was included to evaluate the machine learning model. Following, two clinical OTVA databases from different hospitals, including ECGs from 365 patients, were used as test-sets to assess the generalization of the proposed approach. The results show that V2 was the best lead for classification. Prediction of the SOO in OTVA, using both raw signals or features for classification, presented high accuracy values (>0.96). Generalization of the network trained on simulated data was good for both patient datasets (accuracy of 0.86 and 0.84, respectively) and presented better values than using exclusively real ECGs for classification (accuracy of 0.84 and 0.76 for each dataset). The use of simulated ECG data for training machine learning-based classification algorithms is critical to obtain good SOO predictions in OTVA compared to real data alone. The fast implementation and generalization of the proposed methodology may contribute towards its application to a clinical routine.

Search
Clear search
Close search
Google apps
Main menu