62 datasets found
  1. Market Basket Analysis

    • kaggle.com
    Updated Dec 9, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aslan Ahmedov (2021). Market Basket Analysis [Dataset]. https://www.kaggle.com/datasets/aslanahmedov/market-basket-analysis
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 9, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aslan Ahmedov
    Description

    Market Basket Analysis

    Market basket analysis with Apriori algorithm

    The retailer wants to target customers with suggestions on itemset that a customer is most likely to purchase .I was given dataset contains data of a retailer; the transaction data provides data around all the transactions that have happened over a period of time. Retailer will use result to grove in his industry and provide for customer suggestions on itemset, we be able increase customer engagement and improve customer experience and identify customer behavior. I will solve this problem with use Association Rules type of unsupervised learning technique that checks for the dependency of one data item on another data item.

    Introduction

    Association Rule is most used when you are planning to build association in different objects in a set. It works when you are planning to find frequent patterns in a transaction database. It can tell you what items do customers frequently buy together and it allows retailer to identify relationships between the items.

    An Example of Association Rules

    Assume there are 100 customers, 10 of them bought Computer Mouth, 9 bought Mat for Mouse and 8 bought both of them. - bought Computer Mouth => bought Mat for Mouse - support = P(Mouth & Mat) = 8/100 = 0.08 - confidence = support/P(Mat for Mouse) = 0.08/0.09 = 0.89 - lift = confidence/P(Computer Mouth) = 0.89/0.10 = 8.9 This just simple example. In practice, a rule needs the support of several hundred transactions, before it can be considered statistically significant, and datasets often contain thousands or millions of transactions.

    Strategy

    • Data Import
    • Data Understanding and Exploration
    • Transformation of the data – so that is ready to be consumed by the association rules algorithm
    • Running association rules
    • Exploring the rules generated
    • Filtering the generated rules
    • Visualization of Rule

    Dataset Description

    • File name: Assignment-1_Data
    • List name: retaildata
    • File format: . xlsx
    • Number of Row: 522065
    • Number of Attributes: 7

      • BillNo: 6-digit number assigned to each transaction. Nominal.
      • Itemname: Product name. Nominal.
      • Quantity: The quantities of each product per transaction. Numeric.
      • Date: The day and time when each transaction was generated. Numeric.
      • Price: Product price. Numeric.
      • CustomerID: 5-digit number assigned to each customer. Nominal.
      • Country: Name of the country where each customer resides. Nominal.

    imagehttps://user-images.githubusercontent.com/91852182/145270162-fc53e5a3-4ad1-4d06-b0e0-228aabcf6b70.png">

    Libraries in R

    First, we need to load required libraries. Shortly I describe all libraries.

    • arules - Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
    • arulesViz - Extends package 'arules' with various visualization. techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
    • tidyverse - The tidyverse is an opinionated collection of R packages designed for data science.
    • readxl - Read Excel Files in R.
    • plyr - Tools for Splitting, Applying and Combining Data.
    • ggplot2 - A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
    • knitr - Dynamic Report generation in R.
    • magrittr- Provides a mechanism for chaining commands with a new forward-pipe operator, %>%. This operator will forward a value, or the result of an expression, into the next function call/expression. There is flexible support for the type of right-hand side expressions.
    • dplyr - A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
    • tidyverse - This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step.

    imagehttps://user-images.githubusercontent.com/91852182/145270210-49c8e1aa-9753-431b-a8d5-99601bc76cb5.png">

    Data Pre-processing

    Next, we need to upload Assignment-1_Data. xlsx to R to read the dataset.Now we can see our data in R.

    imagehttps://user-images.githubusercontent.com/91852182/145270229-514f0983-3bbb-4cd3-be64-980e92656a02.png"> imagehttps://user-images.githubusercontent.com/91852182/145270251-6f6f6472-8817-435c-a995-9bc4bfef10d1.png">

    After we will clear our data frame, will remove missing values.

    imagehttps://user-images.githubusercontent.com/91852182/145270286-05854e1a-2b6c-490e-ab30-9e99e731eacb.png">

    To apply Association Rule mining, we need to convert dataframe into transaction data to make all items that are bought together in one invoice will be in ...

  2. Retail Transactions Dataset

    • kaggle.com
    Updated May 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prasad Patil (2024). Retail Transactions Dataset [Dataset]. https://www.kaggle.com/datasets/prasad22/retail-transactions-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 18, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Prasad Patil
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset was created to simulate a market basket dataset, providing insights into customer purchasing behavior and store operations. The dataset facilitates market basket analysis, customer segmentation, and other retail analytics tasks. Here's more information about the context and inspiration behind this dataset:

    Context:

    Retail businesses, from supermarkets to convenience stores, are constantly seeking ways to better understand their customers and improve their operations. Market basket analysis, a technique used in retail analytics, explores customer purchase patterns to uncover associations between products, identify trends, and optimize pricing and promotions. Customer segmentation allows businesses to tailor their offerings to specific groups, enhancing the customer experience.

    Inspiration:

    The inspiration for this dataset comes from the need for accessible and customizable market basket datasets. While real-world retail data is sensitive and often restricted, synthetic datasets offer a safe and versatile alternative. Researchers, data scientists, and analysts can use this dataset to develop and test algorithms, models, and analytical tools.

    Dataset Information:

    The columns provide information about the transactions, customers, products, and purchasing behavior, making the dataset suitable for various analyses, including market basket analysis and customer segmentation. Here's a brief explanation of each column in the Dataset:

    • Transaction_ID: A unique identifier for each transaction, represented as a 10-digit number. This column is used to uniquely identify each purchase.
    • Date: The date and time when the transaction occurred. It records the timestamp of each purchase.
    • Customer_Name: The name of the customer who made the purchase. It provides information about the customer's identity.
    • Product: A list of products purchased in the transaction. It includes the names of the products bought.
    • Total_Items: The total number of items purchased in the transaction. It represents the quantity of products bought.
    • Total_Cost: The total cost of the purchase, in currency. It represents the financial value of the transaction.
    • Payment_Method: The method used for payment in the transaction, such as credit card, debit card, cash, or mobile payment.
    • City: The city where the purchase took place. It indicates the location of the transaction.
    • Store_Type: The type of store where the purchase was made, such as a supermarket, convenience store, department store, etc.
    • Discount_Applied: A binary indicator (True/False) representing whether a discount was applied to the transaction.
    • Customer_Category: A category representing the customer's background or age group.
    • Season: The season in which the purchase occurred, such as spring, summer, fall, or winter.
    • Promotion: The type of promotion applied to the transaction, such as "None," "BOGO (Buy One Get One)," or "Discount on Selected Items."

    Use Cases:

    • Market Basket Analysis: Discover associations between products and uncover buying patterns.
    • Customer Segmentation: Group customers based on purchasing behavior.
    • Pricing Optimization: Optimize pricing strategies and identify opportunities for discounts and promotions.
    • Retail Analytics: Analyze store performance and customer trends.

    Note: This dataset is entirely synthetic and was generated using the Python Faker library, which means it doesn't contain real customer data. It's designed for educational and research purposes.

  3. Z

    Label-Free Detection Market - By Technology (Mass Spectrometry, Bio-Layer...

    • zionmarketresearch.com
    pdf
    Updated Jul 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zion Market Research (2025). Label-Free Detection Market - By Technology (Mass Spectrometry, Bio-Layer Interferometry, Surface Plasmon Resonance, Microplates, and Calorimetry), By Product Type (Instruments and Consumables), By End-User (CROs, Pharma & Biotech Firms, and Academic & Research Institutions), By Application (Binding Kinetics, Concentration Analysis, Affinity Analysis, and End-Point Screening & Specificity Testing), and By Region - Global and Regional Industry Overview, Comprehensive Analysis, Historical Data, and Forecasts 2024-2032 [Dataset]. https://www.zionmarketresearch.com/report/label-free-detection-market
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 21, 2025
    Dataset authored and provided by
    Zion Market Research
    License

    https://www.zionmarketresearch.com/privacy-policyhttps://www.zionmarketresearch.com/privacy-policy

    Time period covered
    2022 - 2030
    Area covered
    Global
    Description

    Global Label-Free Detection Market market size valued at US$ 14.13 Billion in 2023, set to reach US$ 41.76 Billion by 2032 at a CAGR of about 12.8% from 2024 to 2032.

  4. m

    Boronate Affinity Chromatography Market Size, Share & Industry Analysis 2033...

    • marketresearchintellect.com
    Updated Jan 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Boronate Affinity Chromatography Market Size, Share & Industry Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/boronate-affinity-chromatography-market/
    Explore at:
    Dataset updated
    Jan 29, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Discover the latest insights from Market Research Intellect's report_name, valued at current_value in 2024, with significant growth projected to forecast_value by 2033 at a CAGR of cagr_value (2026-2033).

  5. d

    pass_by Psychographic Data | USA | 93% retail coverage

    • datarade.ai
    .json, .csv
    Updated Jul 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pass_by (2024). pass_by Psychographic Data | USA | 93% retail coverage [Dataset]. https://datarade.ai/data-products/pass-by-psychographic-data-usa-93-retail-coverage-pass-by
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Jul 1, 2024
    Dataset authored and provided by
    pass_by
    Area covered
    United States
    Description

    This product provides a monthly breakdown of the shopper profile for individual Points of Interest (POI), offering invaluable insight into the characteristics of who is visiting that location each month. It includes aggregated psychographic and demographic attributes such as age, gender, income level, lifestyle segments, and other key behavioral indicators. Furthermore, it surfaces the distribution of home ZIP codes, illustrating the geographic origins of visitors, and highlights other brands and POIs those same visitors also frequent during the month, revealing broader consumer behavior.

    All metrics are consistently expressed as a percentage share of total visits to the POI in that month. This standardized approach allows for robust month-over-month comparison and precise audience trend analysis. Users can therefore comprehensively understand how the composition of shoppers is changing over time, where they live, what defines their consumer preferences, and how they behave across the wider retail landscape.

    The data is fully anonymized and aggregated, with no access to individual-level or device-level records. It is delivered monthly and is commonly utilized for in-depth audience profiling, strategic market segmentation, powerful brand affinity analysis, and informed strategic decision-making

  6. Customer360Insights

    • kaggle.com
    Updated Jun 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dave Darshan (2024). Customer360Insights [Dataset]. https://www.kaggle.com/datasets/davedarshan/customer360insights
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 9, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dave Darshan
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Customer360Insights

    The Customer360Insights dataset is a synthetic collection meticulously designed to mirror the multifaceted nature of customer interactions within an e-commerce platform. It encompasses a wide array of variables, each serving as a pillar to support various analytical explorations. Here’s a breakdown of the dataset and the potential analyses it enables:

    Dataset Description

    • Customer Demographics: Includes FullName, Gender, Age, CreditScore, and MonthlyIncome. These variables provide a demographic snapshot of the customer base, allowing for segmentation and targeted marketing analysis.
    • Geographical Data: Comprising Country, State, and City, this section facilitates location-based analytics, market penetration studies, and regional sales performance.
    • Product Information: Details like Category, Product, Cost, and Price enable product trend analysis, profitability assessment, and inventory optimization.
    • Transactional Data: Captures the customer journey through SessionStart, CartAdditionTime, OrderConfirmation, OrderConfirmationTime, PaymentMethod, and SessionEnd. This rich temporal data can be used for funnel analysis, conversion rate optimization, and customer behavior modeling.
    • Post-Purchase Details: With OrderReturn and ReturnReason, analysts can delve into return rate calculations, post-purchase satisfaction, and quality control.

    Types of Analysis

    • Descriptive Analytics: Understand basic metrics like average monthly income, most common product categories, and typical credit scores.
    • Predictive Analytics: Use machine learning to predict credit risk or the likelihood of a purchase based on demographics and session activity.
    • Customer Segmentation: Group customers by demographics or purchasing behavior to tailor marketing strategies.
    • Geospatial Analysis: Examine sales distribution across different regions and optimize logistics. Time Series Analysis: Study the seasonality of purchases and session activities over time.
    • Funnel Analysis: Evaluate the customer journey from session start to order confirmation and identify drop-off points.
    • Cohort Analysis: Track customer cohorts over time to understand retention and repeat purchase patterns.
    • Market Basket Analysis: Discover product affinities and develop cross-selling strategies.

    This dataset is a playground for data enthusiasts to practice cleaning, transforming, visualizing, and modeling data. Whether you’re conducting A/B testing for marketing campaigns, forecasting sales, or building customer profiles, Customer360Insights offers a rich, realistic dataset for honing your data science skills.

    Curious about how I created the data? Feel free to click here and take a peek! 😉

    📊🔍 Good Luck and Happy Analysing 🔍📊

  7. m

    Affinity Resins Market Size, Share & Industry Analysis 2033

    • marketresearchintellect.com
    Updated Jul 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Affinity Resins Market Size, Share & Industry Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-affinity-resins-market/
    Explore at:
    Dataset updated
    Jul 18, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Discover the latest insights from Market Research Intellect's Affinity Resins Market Report, valued at USD 2.5 billion in 2024, with significant growth projected to USD 4.1 billion by 2033 at a CAGR of 6.2% (2026-2033).

  8. Affinity Chromatography Reagents Market Analysis North America, Europe,...

    • technavio.com
    Updated Aug 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Technavio (2024). Affinity Chromatography Reagents Market Analysis North America, Europe, Asia, Rest of World (ROW) - US, Germany, UK, China, Japan - Size and Forecast 2024-2028 [Dataset]. https://www.technavio.com/report/affinity-chromatography-reagents-market-industry-analysis
    Explore at:
    Dataset updated
    Aug 15, 2024
    Dataset provided by
    TechNavio
    Authors
    Technavio
    Time period covered
    2021 - 2025
    Area covered
    Global
    Description

    Snapshot img

    Affinity Chromatography Reagents Market Size 2024-2028

    The affinity chromatography reagents market size is forecast to increase by USD 610.7 million at a CAGR of 10.68% between 2023 and 2028.

    The market is experiencing significant growth due to the increasing research and development in the biopharmaceutical industry. The rising demand for therapeutic antibodies, which are primarily produced using affinity chromatography, is a key driver for market expansion. However, this market is not without challenges. Stringent regulations regarding purity standards, particularly in the pharmaceutical sector, necessitate the use of high-quality affinity chromatography reagents to ensure product safety and efficacy. As a result, companies must invest in research and development to meet these regulatory requirements and maintain a competitive edge. Additionally, the market is witnessing a trend towards the use of alternative chromatography techniques, such as protein A and protein G, which may impact the market share of other affinity chromatography reagents. Companies seeking to capitalize on market opportunities and navigate challenges effectively should focus on innovation, regulatory compliance, and cost-effective production methods.

    What will be the Size of the Affinity Chromatography Reagents Market during the forecast period?

    Request Free SampleThe market in the United States is experiencing significant growth due to the increasing demand for effective separation and purification solutions in various industries. This market caters to research activities in life sciences, healthcare infrastructure, and the biotechnology industry, with applications in environmental pollution monitoring, gas chromatography, and clinical diagnostics. The market's sizeable expansion can be attributed to the growing focus on personalized medicine, biopharmaceutical production, and bioprocessing workflows. Key drivers include the isolation of biomolecules, such as proteins, nucleic acids, and biopolymers, as well as the increasing use of specialized training and alternative analytical methods like mass spectrometry and thin layer chromatography. Buffers and ligands play a crucial role in affinity chromatography, ensuring the successful binding and separation of target molecules. The cosmetic industry also contributes to the market's growth, with an emphasis on quality control and purification in bioprocessing and biotechnology. Overall, the market is poised for continued expansion, driven by the diverse needs of research, healthcare, and industrial applications.

    How is this Affinity Chromatography Reagents Industry segmented?

    The affinity chromatography reagents industry research report provides comprehensive data (region-wise segment analysis), with forecasts and estimates in 'USD million' for the period 2024-2028, as well as historical data from 2018-2022 for the following segments. End-userPharmaceutical and biotechnology industryFood and beverage industryCosmetic industryOthersApplicationAnalytical chromatography reagentsPreparative chromatography reagentsGeographyNorth AmericaUSEuropeGermanyUKAsiaChinaJapanRest of World (ROW)

    By End-user Insights

    The pharmaceutical and biotechnology industry segment is estimated to witness significant growth during the forecast period.Affinity chromatography reagents play a crucial role in various industries, including drug discovery and healthcare, due to their ability to purify complex biomolecules such as proteins, nucleic acids, enzymes, and monoclonal antibodies. This technology is particularly valuable in food safety applications, where it is used to isolate and detect contaminants like toxins, pesticides, and heavy metals. In the field of biomedical research, affinity chromatography is employed for the separation and purification of target molecules from environmental samples, ensuring regulatory compliance and product standardization. Moreover, the aging population and the rise of chronic diseases have fueled the demand for high-purity biopharmaceuticals and personalized medicine, leading to an increased focus on bioprocessing and separation techniques. Affinity chromatography reagents, including buffers, resins, ligands, and magnetic beads, are essential components of these workflows. Collaborative research initiatives and academic institutions are also driving the adoption of affinity chromatography reagents in life sciences research, as they offer customizable solutions for a wide range of applications. Analytical instruments, such as mass spectrometry and gas chromatography, are often employed in conjunction with affinity chromatography for the identification and quantification of biomolecules. The cosmetic industry also relies on affinity chromatography for the purification of enzymes and proteins used in various cosmetic formulations. With the growing emphasis on quality and safety, the need for affinity chr

  9. i

    Protein A Affinity Resin Market - Latest Advancement And Analysis

    • imrmarketreports.com
    Updated Nov 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swati Kalagate; Akshay Patil; Vishal Kumbhar (2023). Protein A Affinity Resin Market - Latest Advancement And Analysis [Dataset]. https://www.imrmarketreports.com/reports/protein-a-affinity-resin-market
    Explore at:
    Dataset updated
    Nov 2023
    Dataset provided by
    IMR Market Reports
    Authors
    Swati Kalagate; Akshay Patil; Vishal Kumbhar
    License

    https://www.imrmarketreports.com/privacy-policy/https://www.imrmarketreports.com/privacy-policy/

    Description

    The report offers Protein A Affinity Resin Market Dynamics, Comprises Industry development drivers, challenges, opportunities, threats and limitations. A report also incorporates Cost Trend of products, Mergers & Acquisitions, Expansion, Crucial Suppliers of products, Concentration Rate of Steel Coupling Economy. Global Protein A Affinity Resin Market Research Report covers Market Effect Factors investigation chiefly included Technology Progress, Consumer Requires Trend, External Environmental Change.

  10. Analysis Products: Transcription factor stoichiometry, motif affinity and...

    • zenodo.org
    tsv, zip
    Updated Nov 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Surag Nair; Surag Nair; Mohamed Ameen; Kevin Wang; Kevin Wang; Anshul Kundaje; Anshul Kundaje; Mohamed Ameen (2023). Analysis Products: Transcription factor stoichiometry, motif affinity and syntax regulate single cell chromatin dynamics during fibroblast reprogramming to pluripotency [Dataset]. http://doi.org/10.5281/zenodo.8313962
    Explore at:
    zip, tsvAvailable download formats
    Dataset updated
    Nov 11, 2023
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Surag Nair; Surag Nair; Mohamed Ameen; Kevin Wang; Kevin Wang; Anshul Kundaje; Anshul Kundaje; Mohamed Ameen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This record contains analysis products for the paper "Transcription factor stoichiometry, motif affinity and syntax regulate single cell chromatin dynamics during fibroblast reprogramming to pluripotency" by Nair, Ameen et al. Please refer to the READMEs in the directories, which are summarized below.

    The record contains the following files:

    `clusters.tsv`: contains the cluster id, name and colour of clusters in the paper

    scATAC.zip

    Analysis products for the single-cell ATAC-seq data. Contains:

    - `cells.tsv`: list of barcodes that pass QC. Columns include:
    - `barcode`
    - `sample`: (time point)
    - `umap1`
    - `umap2`
    - `cluster`
    - `dpt_pseudotime_fibr_root`: pseudotime values treating a fibroblast cell as root
    - `dpt_pseudotime_xOSK_root`: pseudotime values treating xOSK cell as root
    - `peaks.bed`: list of peaks of 500bp across all cell states. 4th column contains the peak set label. Note that ~5000 peaks are not assigned to any peak set and are marked as NA.
    - `features.tsv`: 50 dimensional representation of each cell
    - `cell_x_peak.mtx.gz`: sparse matrix of fragment counts within peaks. Load using scipy.io.mmread in python or readMM in R. Columns correspond to cells from `cells.tsv` (combine sample + barcode). Rows correspond to peaks in `peaks.bed`

    scATAC_clusters.zip

    Analysis products corresponding to cluster pseudo-bulks of the single-cell ATAC-seq data.

    - `clusters.tsv`: contains the cluster id, name and colour used in the paper
    - `peaks`: contains `overlap_reproducibilty/overlap.optimal_peak` peaks called using ENCODE bulk ATAC-seq pipeline in the narrowPeak format.
    - `fragments`: contains per cluster fragment files

    scATAC_scRNA_integration.zip

    Analysis products from the integration of scATAC with scRNA. Contains:

    - `peak_gene_links_fdr1e-4.tsv`: file with peak gene links passing FDR 1e-4. For analyses in the paper, we filter to peaks with absolute correlation >0.45.
    - `harmony.cca.30.feat.tsv`: 30 dimensional co-embedding for scATAC and scRNA cells obtained by CCA followed by applying Harmony over assay type.
    - `harmony.cca.metadata.tsv`: UMAP coordinates for scATAC and scRNA cells derived from the Harmony CCA embedding. First column contains barcode.

    scRNA.zip

    Analysis products for the single-cell RNA-seq data. Contains:

    - `seurat.rds`: seurat object that contains expression data (raw counts, normalized, and scaled), reductions (umap, pca), knn graphs, all associated metadata. Note that barcode suffix (1-9 corresponds to samples D0, D2, ..., D14, iPSC)
    - `genes.txt`: list of all genes
    - `cells.tsv`: list of barcodes that pass QC across samples. Contains:
    - `barcode_sample`: barcode with index of sample (1-9 corresponding to D0, D2, ..., D14, iPSC)
    - `sample`: sample name (D0, D2, .., D14, iPSC)
    - `umap1`
    - `umap2`
    - `nCount_RNA`
    - `nFeature_RNA`
    - `cluster`
    - `percent.mt`: percent of mitochondrial transcripts in cell
    - `percent.oskm`: percent of OSKM transcripts in cell
    - `gene_x_cell.mtx.gz`: sparse matrix of gene counts. Load using scipy.io.mmread in python or readMM in R. Columns correspond to cells from `cells.tsv` (barcode suffix contains sample information). Rows correspond to genes in `genes.txt`
    - `pca.tsv`: first 50 PC of each cell
    - `oskm_endo_sendai.tsv`: estimated raw counts (cts, may not be integers) and log(1+ tp10k) normalized expression (norm) for endogenous and exogenous (Sendai derived) counts of POU5F1 (OCT4), SOX2, KLF4 and MYC genes. Rows are consistent with `seurat.rds` and `cells.tsv`

    multiome.zip

    multiome/snATAC:

    These files are derived from the integration of nuclei from multiome (D1M and D2M), with cells from day 2 of scATAC-seq (labeled D2).

    - `cells.tsv`: This is the list of nuclei barcodes that pass QC from multiome AND also cell barcodes from D2 of scATAC-seq. Includes:
    - `barcode`
    - `umap1`: These are the coordinates used for the figures involving multiome in the paper.
    - `umap2`: ^^^
    - `sample`: D1M and D2M correspond to multiome, D2 corresponds to day 2 of scATAC-seq
    - `cluster`: For multiome barcodes, these are labels transfered from scATAC-seq. For D2 scATAC-seq, it is the original cluster labels.
    - `peaks.bed`: This is the same file as scATAC/peaks.bed. List of peaks of 500bp. 4th column contains the peak set label. Note that ~5000 peaks are not assigned to any peak set and are marked as NA.
    - `cell_x_peak.mtx.gz`: sparse matrix of fragment counts within peaks. Load using scipy.io.mmread in python or readMM in R. Columns correspond to cells from `cells.tsv` (combine sample + barcode). Rows correspond to peaks in `peaks.bed`.
    - `features.no.harmony.50d.tsv`: 50 dimensional representation of each cell prior to running Harmony (to correct for batch effect between D2 scATAC and D1M,D2M snMultiome). Rows correspond to cells from `cells.tsv`.
    - `features.harmony.10d.tsv`: 10 dimensional representation of each cell after running Harmony. Rows correspond to cells from `cells.tsv`.

    multiome/snRNA:

    - `seurat.rds`: seurat object that contains expression data (raw counts, normalized, and scaled), reductions (umap, pca),associated metadata. Note that barcode suffix (1,2 corresponds to samples D1M, D2M). Please use the UMAP/features from snATAC/ for consistency.
    - `genes.txt`: list of all genes (this is different from the list in scRNA analysis)
    - `cells.tsv`: list of barcodes that pass QC across samples. Contains:
    - `barcode_sample`: barcode with index of sample (1,2 corresponding to D1M, D2M respectively)
    - `sample`: sample name (D1M, D2M)
    - `nCount_RNA`
    - `nFeature_RNA`
    - `percent.oskm`: percent of OSKM genes in cell
    - `gene_x_cell.mtx.gz`: sparse matrix of gene counts. Load using scipy.io.mmread in python or readMM in R. Columns correspond to cells from `cells.tsv` (barcode suffix contains sample information). Rows correspond to genes in `genes.txt`

  11. m

    Affinity Chromatography Reagents Market Industry Size, Share & Growth...

    • marketresearchintellect.com
    Updated Jul 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). Affinity Chromatography Reagents Market Industry Size, Share & Growth Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-affinity-chromatography-reagents-market/
    Explore at:
    Dataset updated
    Jul 6, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Explore Market Research Intellect's Affinity Chromatography Reagents Market Report, valued at USD 1.8 billion in 2024, with a projected market growth to USD 3.2 billion by 2033, and a CAGR of 8.3% from 2026 to 2033.

  12. D

    Affinity Chromatography Media Market Report | Global Forecast From 2025 To...

    • dataintelo.com
    csv, pdf, pptx
    Updated Jan 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Affinity Chromatography Media Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/affinity-chromatography-media-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Jan 7, 2025
    Authors
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Affinity Chromatography Media Market Outlook



    The global affinity chromatography media market size was valued at approximately USD 985 million in 2023 and is projected to reach around USD 1.65 billion by 2032, growing at a CAGR of 5.8% during the forecast period. This significant growth is driven by the increasing demand for advanced purification techniques in the pharmaceutical and biotechnology sectors, among other factors.



    The primary growth factor fueling the affinity chromatography media market is the expanding pharmaceutical and biotechnology industry. As companies in these sectors increasingly engage in the development and production of novel therapeutics, the need for efficient and effective purification techniques is paramount. Affinity chromatography media provides high specificity and yield in the purification processes, making it a preferred choice. Additionally, advancements in biotechnology and the rising prevalence of chronic diseases are pushing the industry towards more sophisticated drug development processes, further driving the demand for affinity chromatography media.



    Another critical factor contributing to the market's growth is the substantial investment in research and development activities. Both private and public sectors are investing heavily in life sciences research, which includes proteomics and genomics studies. Such research often requires high-purity samples, necessitating the use of advanced purification techniques like affinity chromatography. The increasing focus on personalized medicine and biopharmaceuticals is also propelling the demand for high-quality affinity chromatography media to ensure the efficacy and safety of these novel treatments. Moreover, the shift towards biologics and biosimilars in drug development is creating additional opportunities for market expansion.



    Technological advancements in chromatography techniques are also playing a significant role in market growth. Innovations such as the development of new affinity ligands, better matrix materials, and enhanced column technologies are improving the efficiency and specificity of affinity chromatography. These advancements are helping to overcome some of the limitations associated with traditional purification methods, making affinity chromatography an increasingly attractive option for a range of applications. Furthermore, the integration of automated systems in chromatography processes is streamlining operations and reducing human error, thus enhancing the overall reliability and efficiency of purification processes.



    In the realm of advanced purification techniques, the AKTA Chromatography Purification Instrument stands out as a pivotal tool in the biotechnology and pharmaceutical industries. This sophisticated instrument is renowned for its precision and efficiency in the separation and purification of biomolecules, making it an essential component in the development of biopharmaceuticals. Its ability to handle complex purification processes with high reproducibility and accuracy is particularly beneficial in research and production settings where consistency is crucial. The AKTA system's versatility allows it to be used in various applications, including protein purification, antibody purification, and enzyme purification, thereby supporting the growing demand for high-quality bioproducts. As the industry continues to evolve, the AKTA Chromatography Purification Instrument remains at the forefront, driving innovation and enhancing the capabilities of researchers and manufacturers alike.



    From a regional perspective, North America holds a significant share of the affinity chromatography media market, driven by the presence of a large number of pharmaceutical and biotechnology companies, as well as substantial investment in life sciences research. Europe follows closely, with significant contributions from countries like Germany, the UK, and France. The Asia Pacific region is expected to witness the highest growth rate during the forecast period, fueled by increasing R&D activities, expanding biopharmaceutical sector, and growing government support for biotechnology research. Latin America and the Middle East & Africa are also emerging markets, showing potential growth due to rising healthcare investments and development of pharmaceutical industries.



    Product Type Analysis



    The affinity chromatography media market is segmented by product type into agarose-based, dextran-based, synthetic polymer-bas

  13. D

    Protein A Affinity Resin Market Report | Global Forecast From 2025 To 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Oct 5, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2024). Protein A Affinity Resin Market Report | Global Forecast From 2025 To 2033 [Dataset]. https://dataintelo.com/report/protein-a-affinity-resin-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Oct 5, 2024
    Authors
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Protein A Affinity Resin Market Outlook



    The global Protein A Affinity Resin market size is projected to witness substantial growth, escalating from $1.2 billion in 2023 to an estimated $2.8 billion by 2032, expanding at a robust CAGR of 9.6% over the forecast period. This growth is majorly driven by the increasing demand for monoclonal antibodies and the growing biopharmaceutical industry.



    One of the primary growth factors for the Protein A Affinity Resin market is the surge in biopharmaceutical manufacturing. As monoclonal antibodies are increasingly used for therapeutic and diagnostic purposes, the need for efficient and high-purity antibody production processes has risen. Protein A affinity resins, known for their specificity and binding efficacy, are essential in the purification steps of monoclonal antibodies. The expanding pipeline of monoclonal antibodies in development and the rise in approvals of biopharmaceutical products further propel the demand for Protein A Affinity Resins.



    Another critical factor contributing to market growth is the advancements in protein engineering and resin development. Innovations in recombinant DNA technology have enabled the production of recombinant Protein A resins with superior properties such as enhanced binding capacity, stability, and reduced leaching. These advancements have improved the performance and lifespan of Protein A resins, making them more cost-effective and efficient for large-scale biopharmaceutical manufacturing. The continuous research and development in this area are expected to offer new growth opportunities for the market.



    Furthermore, increased investment in biotechnology and pharmaceutical research is contributing to the market expansion. Governments and private entities are investing heavily in life sciences and biotechnology research, which fuels the demand for Protein A affinity resins in both industrial and academic settings. The utilization of these resins in various stages of drug development, from clinical research to large-scale manufacturing, underscores their critical role in the biopharmaceutical supply chain. This widespread usage across different research and manufacturing sectors significantly drives market growth.



    Regionally, North America holds a dominant position in the Protein A Affinity Resin market, driven by a well-established biopharmaceutical industry and the presence of major market players. The Asia Pacific region is anticipated to exhibit the highest growth rate over the forecast period, propelled by increasing healthcare investments, expanding pharmaceutical industries, and growing research activities. Europe also represents a significant market due to advanced healthcare infrastructure and substantial R&D spending by pharmaceutical companies. Each of these regions offers unique opportunities and challenges that shape the dynamics of the Protein A Affinity Resin market globally.



    Product Type Analysis



    The Protein A Affinity Resin market, segmented by product type into Natural Protein A Resin and Recombinant Protein A Resin, exhibits varying demand dynamics. Natural Protein A resins, derived from Staphylococcus aureus, are known for their high specificity to the Fc region of antibodies. However, challenges such as batch-to-batch variability and potential immunogenicity issues have spurred the development and increased adoption of Recombinant Protein A Resins. Recombinant variants, engineered for enhanced performance, offer consistent quality, improved binding capacities, and reduced leaching of Protein A, making them more suitable for large-scale, high-purity monoclonal antibody production.



    Natural Protein A Resins continue to be utilized in traditional applications where cost constraints are critical, and the required purity levels are achievable with these resins. They are often preferred in academic research settings and small-scale production where the variability can be effectively managed. However, the increasing complexity and quality standards in biopharmaceutical manufacturing are gradually shifting the preference towards recombinant versions.



    Recombinant Protein A Resins, designed with advanced engineering techniques, provide enhanced stability and performance. They are less prone to degradation and offer higher binding capacities, which are crucial for industrial-scale biopharmaceutical manufacturing. The ability to withstand harsher conditions and multiple purification cycles makes them more cost-effective in the long run, despite their higher initial costs. This segment is expected to witness robust growth, driven by

  14. Z

    Membrane Chromatography Market By Product (consumables and accessories), By...

    • zionmarketresearch.com
    pdf
    Updated Jul 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zion Market Research (2025). Membrane Chromatography Market By Product (consumables and accessories), By Technique (hydrophobic interaction, ion exchange, and affinity membrane chromatography), By Operation (bind-elute membrane and flow-through chromatography), By end-users (academic & research institutes, CROs, and pharmaceutical & biopharmaceutical companies) and By Region: Global Industry Analysis, Size, Share, Growth, Trends, and Forecast 2024-2032 [Dataset]. https://www.zionmarketresearch.com/report/membrane-chromatography-market
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 21, 2025
    Dataset authored and provided by
    Zion Market Research
    License

    https://www.zionmarketresearch.com/privacy-policyhttps://www.zionmarketresearch.com/privacy-policy

    Time period covered
    2022 - 2030
    Area covered
    Global
    Description

    Global membrane chromatography market size earned $269.54 Million in 2023 and is expected to generate $826.63 Million by 2032, growing at 13.26%.

  15. Z

    HbA1c Testing Devices Market By Product Type (Point-of-Care Testing Devices,...

    • zionmarketresearch.com
    pdf
    Updated Jul 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zion Market Research (2025). HbA1c Testing Devices Market By Product Type (Point-of-Care Testing Devices, Laboratory-Based Analyzers, Home Testing Kits, and Continuous Glucose Monitoring Systems), By Technology (Immunoassay, Ion-Exchange HPLC, Enzymatic Assays, and Boronate Affinity), By End-User (Hospitals, Diagnostic Laboratories, Home Care Settings, and Physician Offices), By Application (Diabetes Diagnosis, Diabetes Management, Pre-diabetes Screening, and Research Applications), and By Region - Global and Regional Industry Overview, Market Intelligence, Comprehensive Analysis, Historical Data, and Forecasts 2025 - 2034 [Dataset]. https://www.zionmarketresearch.com/report/hba1c-testing-devices-market
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Jul 9, 2025
    Dataset authored and provided by
    Zion Market Research
    License

    https://www.zionmarketresearch.com/privacy-policyhttps://www.zionmarketresearch.com/privacy-policy

    Time period covered
    2022 - 2030
    Area covered
    Global
    Description

    Global HbA1c testing devices market was valued at $2.34 billion in 2024 and is expected to $4.36 billion by 2034, a CAGR of 6.40% between 2025 and 2034.

  16. t

    Global 2025 - Players, Regions, Product Types, Application & Forecast...

    • theindustrystats.com
    Updated Mar 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Industry Stats Market Research (2025). Global 2025 - Players, Regions, Product Types, Application & Forecast Analysis [Dataset]. https://theindustrystats.com/report/industrial-purification-chromatography-packing-market/23976/
    Explore at:
    Dataset updated
    Mar 21, 2025
    Dataset authored and provided by
    The Industry Stats Market Research
    License

    https://theindustrystats.com/privacy-policy/https://theindustrystats.com/privacy-policy/

    Area covered
    Global
    Description

    Product Market size is rising upward in the past few years And it is estimated that the market will grow significantly in the forecasted period

    ATTRIBUTESDETAILS
    STUDY PERIOD2017-2030
    BASE YEAR2024
    FORECAST PERIOD2025-2030
    HISTORICAL PERIOD2017-2024
    UNITVALUE (USD MILLION)
    KEY COMPANIES PROFILEDSepax-tech, Thermo Fisher Scientific, Agilent, Tosoh, EMD Millipore, Bio-Rad Laboratories Inc., Repligen Corporation, Nano Micro Tech
    SEGMENTS COVEREDBy Product Type - Affinity Chromatography Packing, Composite Chromatography Packing, Ion Exchange Chromatography Packing, Size Exclusion Chromatography Packing, Hydrophobic Chromatography Packing, Polymer Reverse Chromatography Packing, Silica Gel Matrix Chromatography Packing
    By Application - Biotechnology, Pharmaceutical Industry, Food Safety, Environmental Protection Testing, Academic Research, Others
    By Sales Channels - Direct Channel, Distribution Channel
    By Geography - North America, Europe, Asia-Pacific, South America, Middle East and Africa

  17. L

    Luxury Goods Market in France Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jan 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Luxury Goods Market in France Report [Dataset]. https://www.datainsightsmarket.com/reports/luxury-goods-market-in-france-4504
    Explore at:
    doc, pdf, pptAvailable download formats
    Dataset updated
    Jan 3, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global, France
    Variables measured
    Market Size
    Description

    The size of the Luxury Goods Market in France market was valued at USD XX Million in 2023 and is projected to reach USD XXX Million by 2032, with an expected CAGR of 6.47% during the forecast period. Luxury goods refer to high-end products that are characterized by their exclusivity, superior quality, and high price point. These items are often associated with prestige and status, offering not only exceptional craftsmanship but also a unique experience or brand heritage. Luxury goods encompass a wide range of categories, including fashion items like designer clothing, accessories, and footwear; jewelry and watches high-end automobiles; and premium real estate. The luxury market is distinguished by its emphasis on exclusivity and exceptional quality. Materials used are often rare or of the highest grade, and production processes involve meticulous attention to detail. For instance, luxury fashion brands may use fine fabrics and artisanal techniques, while high-end watches often feature intricate mechanical movements and precious metals. This growth is fueled by a combination of elements, including the elite's disposable income, growing discretionary expenditure, expanding urbanization, and a taste for finer things. The rising desire for branded and designer goods among more affluent persons in metropolitan areas is also a contributing factor. Recent developments include: In April 2022, well-known skincare brand Shiseido declared the launch of its new skincare products in the french market., In October 2021, LVMH declared that they have acquired the French-based fragrance brand Officine Universelle Buly 1803 to sell its variety of heritage perfumes globally including in France., In October 2021, the well-known luxury footwear brand Roger Vivier declared the launch of its latest footwear and accessories under its fall '21 collection.. Key drivers for this market are: Inclination Towards Natural and Organic Formulations. Potential restraints include: Presence of Counterfeit Beauty and Personal Care Products. Notable trends are: High Affinity for Luxury Perfumes.

  18. m

    High Affinity Nerve Growth Factor Receptor Sales Market Size, Share & Trends...

    • marketresearchintellect.com
    Updated Jul 6, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Market Research Intellect (2025). High Affinity Nerve Growth Factor Receptor Sales Market Size, Share & Trends Analysis 2033 [Dataset]. https://www.marketresearchintellect.com/product/global-high-affinity-nerve-growth-factor-receptor-sales-market/
    Explore at:
    Dataset updated
    Jul 6, 2025
    Dataset authored and provided by
    Market Research Intellect
    License

    https://www.marketresearchintellect.com/privacy-policyhttps://www.marketresearchintellect.com/privacy-policy

    Area covered
    Global
    Description

    Dive into Market Research Intellect's High Affinity Nerve Growth Factor Receptor Sales Market Report, valued at USD 450 million in 2024, and forecast to reach USD 1.2 billion by 2033, growing at a CAGR of 12.3% from 2026 to 2033.

  19. Affinity 30 Import Data India – Buyers & Importers List

    • seair.co.in
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Affinity 30 Import Data India – Buyers & Importers List [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    India
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  20. Etq Import Data | Affinity Customs Inc

    • seair.co.in
    Updated Feb 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim (2025). Etq Import Data | Affinity Customs Inc [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset updated
    Feb 7, 2025
    Dataset provided by
    Seair Exim Solutions
    Authors
    Seair Exim
    Area covered
    United States
    Description

    Explore detailed Etq import data of Affinity Customs Inc in the USA—product details, price, quantity, origin countries, and US ports.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Aslan Ahmedov (2021). Market Basket Analysis [Dataset]. https://www.kaggle.com/datasets/aslanahmedov/market-basket-analysis
Organization logo

Market Basket Analysis

Analyzing Consumer Behaviour Using MBA Association Rule Mining

Explore at:
2 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 9, 2021
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Aslan Ahmedov
Description

Market Basket Analysis

Market basket analysis with Apriori algorithm

The retailer wants to target customers with suggestions on itemset that a customer is most likely to purchase .I was given dataset contains data of a retailer; the transaction data provides data around all the transactions that have happened over a period of time. Retailer will use result to grove in his industry and provide for customer suggestions on itemset, we be able increase customer engagement and improve customer experience and identify customer behavior. I will solve this problem with use Association Rules type of unsupervised learning technique that checks for the dependency of one data item on another data item.

Introduction

Association Rule is most used when you are planning to build association in different objects in a set. It works when you are planning to find frequent patterns in a transaction database. It can tell you what items do customers frequently buy together and it allows retailer to identify relationships between the items.

An Example of Association Rules

Assume there are 100 customers, 10 of them bought Computer Mouth, 9 bought Mat for Mouse and 8 bought both of them. - bought Computer Mouth => bought Mat for Mouse - support = P(Mouth & Mat) = 8/100 = 0.08 - confidence = support/P(Mat for Mouse) = 0.08/0.09 = 0.89 - lift = confidence/P(Computer Mouth) = 0.89/0.10 = 8.9 This just simple example. In practice, a rule needs the support of several hundred transactions, before it can be considered statistically significant, and datasets often contain thousands or millions of transactions.

Strategy

  • Data Import
  • Data Understanding and Exploration
  • Transformation of the data – so that is ready to be consumed by the association rules algorithm
  • Running association rules
  • Exploring the rules generated
  • Filtering the generated rules
  • Visualization of Rule

Dataset Description

  • File name: Assignment-1_Data
  • List name: retaildata
  • File format: . xlsx
  • Number of Row: 522065
  • Number of Attributes: 7

    • BillNo: 6-digit number assigned to each transaction. Nominal.
    • Itemname: Product name. Nominal.
    • Quantity: The quantities of each product per transaction. Numeric.
    • Date: The day and time when each transaction was generated. Numeric.
    • Price: Product price. Numeric.
    • CustomerID: 5-digit number assigned to each customer. Nominal.
    • Country: Name of the country where each customer resides. Nominal.

imagehttps://user-images.githubusercontent.com/91852182/145270162-fc53e5a3-4ad1-4d06-b0e0-228aabcf6b70.png">

Libraries in R

First, we need to load required libraries. Shortly I describe all libraries.

  • arules - Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules).
  • arulesViz - Extends package 'arules' with various visualization. techniques for association rules and item-sets. The package also includes several interactive visualizations for rule exploration.
  • tidyverse - The tidyverse is an opinionated collection of R packages designed for data science.
  • readxl - Read Excel Files in R.
  • plyr - Tools for Splitting, Applying and Combining Data.
  • ggplot2 - A system for 'declaratively' creating graphics, based on "The Grammar of Graphics". You provide the data, tell 'ggplot2' how to map variables to aesthetics, what graphical primitives to use, and it takes care of the details.
  • knitr - Dynamic Report generation in R.
  • magrittr- Provides a mechanism for chaining commands with a new forward-pipe operator, %>%. This operator will forward a value, or the result of an expression, into the next function call/expression. There is flexible support for the type of right-hand side expressions.
  • dplyr - A fast, consistent tool for working with data frame like objects, both in memory and out of memory.
  • tidyverse - This package is designed to make it easy to install and load multiple 'tidyverse' packages in a single step.

imagehttps://user-images.githubusercontent.com/91852182/145270210-49c8e1aa-9753-431b-a8d5-99601bc76cb5.png">

Data Pre-processing

Next, we need to upload Assignment-1_Data. xlsx to R to read the dataset.Now we can see our data in R.

imagehttps://user-images.githubusercontent.com/91852182/145270229-514f0983-3bbb-4cd3-be64-980e92656a02.png"> imagehttps://user-images.githubusercontent.com/91852182/145270251-6f6f6472-8817-435c-a995-9bc4bfef10d1.png">

After we will clear our data frame, will remove missing values.

imagehttps://user-images.githubusercontent.com/91852182/145270286-05854e1a-2b6c-490e-ab30-9e99e731eacb.png">

To apply Association Rule mining, we need to convert dataframe into transaction data to make all items that are bought together in one invoice will be in ...

Search
Clear search
Close search
Google apps
Main menu