100+ datasets found
  1. 80M Vector Image Dataset – AI Training & Commercial Use

    • nexdata.ai
    Updated Apr 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2025). 80M Vector Image Dataset – AI Training & Commercial Use [Dataset]. https://www.nexdata.ai/datasets/computervision/1794
    Explore at:
    Dataset updated
    Apr 7, 2025
    Dataset authored and provided by
    Nexdata
    Variables measured
    Data size, Image type, Data format, Data content
    Description

    This dataset contains 80 million high-quality vector images (SVG, EPS, AI formats), offering a vast collection for use in computer vision, machine learning, and creative applications. Each image is copyright-cleared and legally sourced through authorized channels, with transparent usage rights for both commercial and academic purposes. The dataset features a wide variety of vector content—icons, illustrations, infographics, and more—with excellent color fidelity and scalable resolution. Ideal for AI model training (e.g., image classification, object recognition), generative design models, and creative design inspiration, this resource ensures traceable IP rights and enables safe, large-scale usage in real-world environments.

  2. h

    VectorEdits

    • huggingface.co
    Updated Jun 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Author Anonymous (2025). VectorEdits [Dataset]. https://huggingface.co/datasets/authoranonymous321/VectorEdits
    Explore at:
    Dataset updated
    Jun 7, 2025
    Authors
    Author Anonymous
    Description

    VectorEdits: A Dataset and Benchmark for Instruction-Based Editing of Vector Graphics

    Paper (Soon) We introduce a large-scale dataset for instruction-guided vector image editing, consisting of over 270,000 pairs of SVG images paired with natural language edit instructions. Our dataset enables training and evaluation of models that modify vector graphics based on textual commands. We describe the data collection process, including image pairing via CLIP similarity and instruction… See the full description on the dataset page: https://huggingface.co/datasets/authoranonymous321/VectorEdits.

  3. Nexdata | Vector Image Data | 80 Million

    • datarade.ai
    • data.nexdata.ai
    Updated Nov 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2025). Nexdata | Vector Image Data | 80 Million [Dataset]. https://datarade.ai/data-products/nexdata-vector-image-data-80-million-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Nov 12, 2025
    Dataset authored and provided by
    Nexdata
    Area covered
    Costa Rica, Argentina, Qatar, Peru, Ukraine, Croatia, Malta, Russian Federation, Romania, Egypt
    Description

    This dataset comprises 80 million vector images. The resources are diverse in type, excellent color accuracy, and rich detail. All materials have been legally obtained through authorized channels, with clear indications of copyright ownership and usage authorization scope. The entire collection provides commercial-grade usage rights and has been granted permission for scientific research use, ensuring clear and traceable intellectual property attribution. The vast and high-quality image resources offer robust support for a wide range of applications, including research in the field of computer vision, training of image recognition algorithms, and sourcing materials for creative design, thereby facilitating efficient progress in related areas.

    Data size

    80 million images

    Image type

    posters, patterns, cartoons, backgrounds and other categories

    Data format

    image formats is .eps

    Data content

    genuine image works released by the author

  4. corel_images

    • kaggle.com
    zip
    Updated Jan 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Akil Elkamel (2020). corel_images [Dataset]. https://www.kaggle.com/elkamel/corel-images
    Explore at:
    zip(29867737 bytes)Available download formats
    Dataset updated
    Jan 1, 2020
    Authors
    Akil Elkamel
    Description

    Context

    • CNN
    • CBIR

    Content

    • A 10 concept groups of images composed each by 100 images.
    • For each concept group the images are divided into 90 images for training and 10 images for test.

    Acknowledgements

    1. 1. W. Bian and D. Tao, “Biased Discriminant Euclidean Embedding for Content based Image Retrieval,” IEEE Transactions on Image Processing (TIP), accept with minor revision.
    2. D. Tao, X. Li, and S. J. Maybank, “Negative Samples Analysis in Relevance Feedback,” IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 19, no. 4, pp. 568-580, April 2007.
    3. D. Tao, X. Tang, X. Li, and X. Wu, “Asymmetric Bagging and Random Subspace for Support Vector Machines-based Relevance Feedback in Image Retrieval,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 28, no.7, pp. 1088-1099, July 2006.
    4. J. Li, N. Allinsion, D. Tao, and X. Li, “Multitraining Support Vector Machine for Image Retrieval,” IEEE Transactions on Image Processing (TIP), vol. 15, no. 11, pp. 3597-3601, November 2006.
    5. D. Tao, X. Tang, X. Li, and Y. Rui, “Kernel Direct Biased Discriminant Analysis: A New Content-based Image Retrieval Relevance Feedback Algorithm,” IEEE Transactions on Multimedia (TMM), vol. 8, no. 4, pp. 716-727, August 2006.
  5. d

    Malaria vector mosquito images

    • search.dataone.org
    • datadryad.org
    Updated Jun 13, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jannelle Couret (2025). Malaria vector mosquito images [Dataset]. http://doi.org/10.5061/dryad.z08kprr92
    Explore at:
    Dataset updated
    Jun 13, 2025
    Dataset provided by
    Dryad Digital Repository
    Authors
    Jannelle Couret
    Time period covered
    Jan 1, 2020
    Description

    We created a novel database of mosquito images by sampling live mosquitoes from established colonies maintained by the Malaria Research and Reference Reagent Resource (MR4)/ Biodefense and Emerging Infections (BEI) Resources at the Centers for Disease Control and Prevention (CDC) in Atlanta, GA. Adults of both sexes were imaged from 15 species of mosquitoes from there genera, 13 Anopheles, 2 Culex and 1 Aedes. There are a total of 1,709 images. We included an additional strain of An. gambiae s.s. resulting in two categories of this species: G3 and KISUMU1. Finally, for An. stephensi we captured images of mosquitoes using the two methods of storing mosquitoes, freezing versus dried samples. Images are folders labeled by genus, species, strain, sex and storage method.

  6. s

    Citation Trends for "Feature-specific vector quantization of images"

    • shibatadb.com
    Updated Feb 15, 1996
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yubetsu (1996). Citation Trends for "Feature-specific vector quantization of images" [Dataset]. https://www.shibatadb.com/article/LzsRJfjG
    Explore at:
    Dataset updated
    Feb 15, 1996
    Dataset authored and provided by
    Yubetsu
    License

    https://www.shibatadb.com/license/data/proprietary/v1.0/license.txthttps://www.shibatadb.com/license/data/proprietary/v1.0/license.txt

    Time period covered
    1999 - 2015
    Variables measured
    New Citations per Year
    Description

    Yearly citation counts for the publication titled "Feature-specific vector quantization of images".

  7. d

    NSCAT Level 3 Daily Gridded Ocean Surface Wind Vector Browse Images (JPL)

    • catalog.data.gov
    • datasets.ai
    • +5more
    Updated Sep 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NASA/JPL/PODAAC (2025). NSCAT Level 3 Daily Gridded Ocean Surface Wind Vector Browse Images (JPL) [Dataset]. https://catalog.data.gov/dataset/nscat-level-3-daily-gridded-ocean-surface-wind-vector-browse-images-jpl-9ec53
    Explore at:
    Dataset updated
    Sep 18, 2025
    Dataset provided by
    NASA/JPL/PODAAC
    Description

    This dataset provides browse images of the NASA Scatterometer (NSCAT) Level 3 daily gridded ocean wind vectors, which are provided at 0.5 degree spatial resolution for ascending and descending passes; wind vectors are averaged at points where adjacent passes overlap. This is the most up-to-date version, which designates the final phase of calibration, validation and science data processing, which was completed in November of 1998, on behalf of the JPL NSCAT Project; wind vectors are processed using the NSCAT-2 geophysical model function. Information and access to the Level 3 source data used to generate these browse images may be accessed at: http://podaac.jpl.nasa.gov/dataset/NSCAT%20LEVEL%203.

  8. h

    tree-of-life-vector-db

    • huggingface.co
    Updated Oct 31, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HDR Imageomics Institute (2025). tree-of-life-vector-db [Dataset]. https://huggingface.co/datasets/imageomics/tree-of-life-vector-db
    Explore at:
    Dataset updated
    Oct 31, 2025
    Dataset authored and provided by
    HDR Imageomics Institute
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Dataset Card for TreeOfLife-10M Vector database

    Persistent files for vector Database created with chromadb containing the embeddings for all images in the imageomics/TreeOfLife-10M dataset.

      Dataset Details
    

    This dataset contains the generated vector database built using ChromaDb as the backend vector database solution for the entire TreeOfLife-10M dataset. The rationale behind creating a vector database was to enable blazingly fast nearest neighbor search. The vector… See the full description on the dataset page: https://huggingface.co/datasets/imageomics/tree-of-life-vector-db.

  9. Training a robot to understand sign language

    • kaggle.com
    zip
    Updated Nov 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Amitabha Banerjee (2019). Training a robot to understand sign language [Dataset]. https://www.kaggle.com/dsv/809494
    Explore at:
    zip(133773677 bytes)Available download formats
    Dataset updated
    Nov 24, 2019
    Authors
    Amitabha Banerjee
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset is a repositpry of sign language images taken by the Anki Vector robot. To understand the American sign language for the English alphabet, please take a look at the following video: https://www.youtube.com/watch?v=a5BD8SjhPSg

    Content

    The dataset contains roughly 8500 images. Images are labelled according to the sign language, for e.g. all images with a_*.png are labels for pictures with sign for the alphabet 'a' taken by vector. All images for the background (with no sign) are labelled as background_a.

    Acknowledgements

    Thanks to the entire ex-Anki team for working on a fantastic robot and making the SDK available free,

    Inspiration

    Lets train a model to enable robots to accurately understand the human sign language.

    More

    More material wrt this dataset is available in my online course: 'Learn AI with a robot', available at http://robotics.thinkific.com

  10. i

    Data Visualization SVG Illustrations Dataset

    • illuhub.com
    svg
    Updated Sep 7, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Illuhub (2025). Data Visualization SVG Illustrations Dataset [Dataset]. https://illuhub.com/illustrations/technology-electronics/data-visual
    Explore at:
    svgAvailable download formats
    Dataset updated
    Sep 7, 2025
    Dataset authored and provided by
    Illuhub
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2024 - Present
    Area covered
    Worldwide
    Variables measured
    Category, File Format, Subcategory
    Description

    Specialized collection of 0 free data visualization SVG illustrations from the technology & electronics category. Data visualization illustrations including bar charts, network graphs, and information graphics Examples include: bar chart, network graph.

  11. open-pmc

    • huggingface.co
    Updated Mar 25, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vector Institute (2025). open-pmc [Dataset]. https://huggingface.co/datasets/vector-institute/open-pmc
    Explore at:
    Dataset updated
    Mar 25, 2025
    Dataset authored and provided by
    Vector Institutehttps://www.vectorinstitute.ai/
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    OPEN-PMC

    Arxiv: Arxiv |
    Code: Open-PMC Github |
    Model Checkpoint: Hugging Face

      Dataset Summary
    

    This dataset consists of image-text pairs extracted from medical papers available on PubMed Central. It has been curated to support research in medical image understanding, particularly in natural language processing (NLP) and computer vision tasks related to medical imagery. The dataset includes:

    Extracted images from research articles.… See the full description on the dataset page: https://huggingface.co/datasets/vector-institute/open-pmc.

  12. The 14 datasets used to build SVM and LS-SVM classification models of FLS.

    • plos.figshare.com
    xls
    Updated Jun 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuang Liu; Haiye Yu; Yuanyuan Sui; Haigen Zhou; Junhe Zhang; Lijuan Kong; Jingmin Dang; Lei Zhang (2023). The 14 datasets used to build SVM and LS-SVM classification models of FLS. [Dataset]. http://doi.org/10.1371/journal.pone.0257008.t004
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 9, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Shuang Liu; Haiye Yu; Yuanyuan Sui; Haigen Zhou; Junhe Zhang; Lijuan Kong; Jingmin Dang; Lei Zhang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The 14 datasets used to build SVM and LS-SVM classification models of FLS.

  13. Graphics software market value: vector graphics 2009-2013

    • statista.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista, Graphics software market value: vector graphics 2009-2013 [Dataset]. https://www.statista.com/statistics/269251/computer-graphics-software-market-value-in-the-vector-graphics-segment/
    Explore at:
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2009
    Area covered
    Worldwide
    Description

    The statistic shows the computer graphics software market value in the vector graphics segment from 2009 to 2013. In 2010, there was a market value of *** million U.S. dollars.

  14. d

    Particle Image Velocimetry Results

    • catalog.data.gov
    • s.cnmilf.com
    Updated Oct 29, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Particle Image Velocimetry Results [Dataset]. https://catalog.data.gov/dataset/particle-image-velocimetry-results
    Explore at:
    Dataset updated
    Oct 29, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    This child item contains the Mathworks Matlab mat-file outputs from the scripts described in the Ancillary Scripts child item. Each file contains the results for a particular field site. See the FGDC metadata Process Steps section for more information about opening these files. The mat-files included here have a standard set of output variables and include a variable named "zzVariableDescriptions" in each mat-file which describes the contents of the file. The following variables and descriptions are included in each mat-file (extracted from the "zzVariableDescriptions" variable):

    • calibration_distance: The distance between calibration points in meters.
    • calibration_points: Pixel coordinates of the calibration points. Format (array): [X1,Y1; X2; Y2]
    • calibration_time: Time increment between image frames in milliseconds.
    • caluv: Correction factor used to convert pixel/second into meters/second.
    • calxy: Pixel ground resolution in meters/pixel.
    • directory: Path to folder containing images used in PIV analysis.
    • filenames: Cell array of strings containing image frame filenames. Format (cellarray): 1m (m: number of frames)
    • imagesLocation: Path to folder containing images used in PIV analysis.
    • i: Dimensions of PIV results. Format: inumber of rows along y-axis
    • j: Dimensions of PIV results. Format: jnumber of columns along x-axis
    • k: Dimensions of PIV results. Format: knumber ofimages or frames in time (numbmer of images processed)
    • p: PIVLab image pre-processing settings. See PIVLab documentation for information.
    • pixel_resolution: Pixel ground resolution in meters. Assumes square pixels.
    • r: PIVLab post-processing settings. See PIVLab documentation for information.
    • resultsFileFullPath: Path to folder containing PIV results in mat-file format.
    • s: PIVLab standard processing settings. See PIVLab documentation for information.
    • typevector: Array (mnp) containing raw vector result type of frame (mn) for each frame (p). Format: type 1-valid PIV vector; type 0-masked vector; type 2-invalid PIV vector
    • typevector_filt: Array (mnp) containing filtered vector result type of frame (mn) for each frame (p). Format: type 1-valid PIV vector; type 0-masked vector; type 2-invalid PIV vector
    • u_mean: Array (mn) containing the temporal average u component of velocity in meters/second. Values are averaged for every vector for each frame (along p dimension).
    • u_stack: Array (mnp) containing filtered u component velocities for each vector (mn) for each frame (p).
    • v_mean: Array (mn) containing the temporal average v component of velocity in meters/second. Values are averaged for every vector for each frame (along p dimension).
    • v_stack: Array (mnp) containing filtered v component velocities for each vector (mn) for each frame (p).
    • x_ground: Array (mn) containing the x (horizontal) ground coordinate in meters for each PIV result vector. Origin of coordinates is the lower left corner.
    • x_pixel: Array (mn) containing the x (horizontal) pixel coordinate for each PIV result vector.
    • y_ground: Array (mn) containing the y (horizontal) ground coordinate in meters for each PIV result vector. Origin of coordinates is the lower left corner.
    • y_pixel: Array (mn) containing the y (horizontal) pixel coordinate for each PIV result vector.
    • zzVariableDescriptions: A structured array containing elements named after each variable in this dataset.

    Each Field Site is abbreviated in various files in this data release. File and folder names are used to quickly identify which site a particular file or dataset represents. The following abbreviations are used:
    • ACR: Androscoggin River, Auburn, Maine, USA
    • AFR: Agua Fria River, near Rock Springs, Arizona, USA
    • CCC: Coachella Canal above All-American Canal Diversion, California, USA
    • CMC: Cochiti East Side Main Channel, near Cochiti, New Mexico, USA
    • GLR: Gila River near Dome, Arizona, USA
    • RMC: Reservation Main Canal near Yuma, Arizona, USA
    • SMC: Sile Main Canal (at head) at Cochiti, New Mexico, USA
    • WMD: Wellton-Mohawk Main Outlet Drain near Yuma, Arizona, USA

  15. m

    Data from: CQ100: A High-Quality Image Dataset for Color Quantization...

    • data.mendeley.com
    Updated Dec 17, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M. Emre Celebi (2024). CQ100: A High-Quality Image Dataset for Color Quantization Research [Dataset]. http://doi.org/10.17632/vw5ys9hfxw.4
    Explore at:
    Dataset updated
    Dec 17, 2024
    Authors
    M. Emre Celebi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    CQ100 is a diverse and high-quality dataset of color images that can be used to develop, test, and compare color quantization algorithms. The dataset can also be used in other color image processing tasks, including filtering and segmentation.

    If you find CQ100 useful, please cite the following publication: M. E. Celebi and M. L. Perez-Delgado, “CQ100: A High-Quality Image Dataset for Color Quantization Research,” Journal of Electronic Imaging, vol. 32, no. 3, 033019, 2023.

    You may download the above publication free of charge from: https://www.spiedigitallibrary.org/journals/journal-of-electronic-imaging/volume-32/issue-3/033019/cq100--a-high-quality-image-dataset-for-color-quantization/10.1117/1.JEI.32.3.033019.full?SSO=1

  16. Parsing four vectors on one MSH

    • kaggle.com
    zip
    Updated Aug 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Глеб Мехряков (2024). Parsing four vectors on one MSH [Dataset]. https://www.kaggle.com/datasets/mephistophel2312/parsing-four-vectors-on-one-msh/data
    Explore at:
    zip(44366 bytes)Available download formats
    Dataset updated
    Aug 26, 2024
    Authors
    Глеб Мехряков
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Глеб Мехряков

    Released under MIT

    Contents

  17. G

    Vector Database Market Research Report 2033

    • growthmarketreports.com
    csv, pdf, pptx
    Updated Sep 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Growth Market Reports (2025). Vector Database Market Research Report 2033 [Dataset]. https://growthmarketreports.com/report/vector-database-market
    Explore at:
    csv, pdf, pptxAvailable download formats
    Dataset updated
    Sep 1, 2025
    Dataset authored and provided by
    Growth Market Reports
    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Vector Database Market Outlook



    According to our latest research, the global vector database market size reached USD 1.12 billion in 2024, demonstrating robust momentum driven by the surging adoption of artificial intelligence and machine learning applications. The market is experiencing a remarkable expansion, registering a CAGR of 22.4% from 2025 to 2033. By 2033, the market is forecasted to reach USD 8.43 billion, underscoring the transformative role of vector databases in powering next-generation data-driven solutions. This extraordinary growth trajectory is fueled by the increasing need for high-performance search and analytics capabilities across industries, as organizations pivot towards leveraging unstructured and semi-structured data for strategic advantage.



    A primary growth factor for the vector database market is the exponential increase in the volume and complexity of unstructured data generated by enterprises. As organizations accumulate vast amounts of images, videos, text, and other rich media, traditional relational databases struggle to provide the speed and scalability required for real-time analysis and retrieval. Vector databases, designed specifically to handle high-dimensional vector representations, have become essential for enabling advanced search and recommendation systems. The proliferation of AI-powered applications, such as semantic search, natural language processing, and image recognition, is amplifying the demand for vector databases, as these systems rely on vector embeddings to deliver accurate and contextually relevant results. Furthermore, the integration of vector databases with popular machine learning frameworks is streamlining the development and deployment of intelligent solutions, accelerating market adoption.



    Another significant driver is the rapid digital transformation across key verticals, including BFSI, healthcare, retail and e-commerce, and IT and telecommunications. Enterprises in these sectors are increasingly leveraging vector databases to enhance customer experiences, improve operational efficiency, and unlock new revenue streams. For instance, in retail and e-commerce, vector databases power personalized recommendation engines and visual search capabilities, driving higher conversion rates and customer satisfaction. In healthcare, they enable advanced medical image analysis and patient data retrieval, supporting better diagnostics and treatment outcomes. The growing emphasis on data-driven decision-making and the need to derive actionable insights from complex datasets are compelling organizations to invest in vector database technologies, further propelling market growth.



    The evolution of deployment models and the rise of cloud-native architectures have also contributed to the expansion of the vector database market. Organizations are increasingly opting for cloud-based vector database solutions to benefit from scalability, flexibility, and cost efficiency. Cloud deployment enables seamless integration with existing IT infrastructure and allows enterprises to scale resources dynamically based on workload demands. This shift is particularly pronounced among small and medium enterprises (SMEs), which often lack the capital and expertise to maintain on-premises infrastructure. The availability of managed vector database services from major cloud providers is lowering the barrier to entry, democratizing access to advanced data management capabilities, and fueling widespread adoption across diverse industry segments.



    The financial services sector is increasingly recognizing the transformative potential of vector search technology. Vector Search for Financial Services is revolutionizing how institutions manage and analyze vast datasets, enabling more accurate risk assessments and personalized customer interactions. By leveraging high-dimensional vector representations, financial organizations can enhance fraud detection, streamline compliance processes, and deliver tailored financial products. This technology is particularly beneficial in real-time trading environments, where rapid data retrieval and analysis are crucial. As the financial industry continues to evolve, the adoption of vector search solutions is set to accelerate, driving innovation and competitive advantage in a data-driven landscape.



    From a regional perspective, North America continues to dominate the vector database market, driven by the p

  18. Aedes Mosquito Image Dataset Version 1.0(AMID v1)

    • kaggle.com
    zip
    Updated Dec 2, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tonmoy Chandro Saha (2024). Aedes Mosquito Image Dataset Version 1.0(AMID v1) [Dataset]. https://www.kaggle.com/datasets/tonmoy406/aedes-mosquito-image-dataset-version-1-0amid-v1
    Explore at:
    zip(268117211 bytes)Available download formats
    Dataset updated
    Dec 2, 2024
    Authors
    Tonmoy Chandro Saha
    License

    Attribution-NonCommercial-NoDerivs 4.0 (CC BY-NC-ND 4.0)https://creativecommons.org/licenses/by-nc-nd/4.0/
    License information was derived automatically

    Description

    Aedes Mosquito Image Dataset Version 1.0 (AMID v1.0)

    Introduction

    The outbreak of dengue fever in recent years has become a grave public health concern as it has spread to 20 countries in South America and South Asia. As vectors of the flavivirus, several mosquito species belonging to the Aedes genus are responsible for transmitting dengue fever. Effective vector surveillance and control are essential in reducing dengue outbreaks. However, due to their minute variations in anatomical structure, it is challenging to identify Aedes mosquitoes without expert entomologists using a microscope. In this regard, deep learning algorithms can play a vital role in identifying mosquitoes using smartphone-captured images and pave the way for deskilling automated vector surveillance, provided that sufficient training examples are available.

    A graphical representation of our working pipeline: https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15444916%2F19a5be5f62e8c13d80e4eeb60f83b5c9%2FA%20flow-diagram%20of%20the%20proposed%20mosquito%20detection%20system.png?generation=1718534243741585&alt=media" alt="">

    Contents

    In this study, we developed the “Aedes Mosquito Image Dataset,” consisting of smartphone-captured mosquito images consisting of 8 class labels: Aedes aegypti, Aedes koreicus, Aedes albopictus, Culex pipiens, Armigeres subalbatus, Culex quinquifasciatus, Aedes japonicus, and others (non-mosquito). The images are collected by trapping mosquitoes in several locations in Dhaka, followed by image capture and expert annotations in collaboration with ICDDR,B. Additional image data is collected from open-access online repositories.

    Some sample images from dataset: https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F15444916%2F95a785845e1d50ee6fc90683c1dbe58b%2FDifferent%20types%20of%20Mosquito%20Species.png?generation=1718537439037243&alt=media" alt="">

    Dataset Description

    The dataset contains a total of 31999 images from 3 sources. The class distribution is presented as follows: | Class label | No. of Images from Mosquito Alert | No. of Images from ICDDR,B | No. of Images from WHO | Total No. of Images (Class label Wise) | |---------------------------------|-----------------------------------|----------------------------|-------------------------|----------------------------------------| | Aedes aegypti | 73 | 247 | 499 | 819 | | Aedes albopictus | 15,268 | 7 | 500 | 15,775 | | Aedes japonicus | 153 | 0 | 0 | 153 | | Aedes koreicus | 38 | 0 | 0 | 38 | | Armigeres subalbatus | 0 | 42 | 0 | 42 | | Culex pipiens | 6,180 | 231 | 0 | 6,411 | | Culex quinquefasciatus | 0 | 0 | 500 | 500 | | Others (non-mosquito) | 8,261 | 0 | 0 | 8,261 | | Total No. of Images (Source Wise) | 29,973 | 527 | 1,499 | 31,999 |

    This dataset has 8 subfolders, which contain 7 kinds of mosquito images and 1 folder 1 folder (Others) for non-mosquito images.

    Source

    The dataset was constructed by sourcing images from Mosquito Alert, WHO accredited breeding laboratory & Trap set up by ICDDR,B. Each image was meticulously recorded along with its respective source information to facilitate further verification and ensure proper attribution. Copyright considerations were duly addressed, adhering to appropriate protocols to safeguard intellectual property rights.

    Naming Convention of the Images

    Each image is assigned a name following the format of SourceCode_ClassLabel_Cropped_CroppingNumber_Resized. The corresponding source codes assigned to each source are: Mosquito Alert -> MSA; ICDDR,B -> ICD; WHO accredited bre...

  19. Nexdata | High-quality Image Caption Data | 300 million pairs

    • datarade.ai
    Updated Nov 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2025). Nexdata | High-quality Image Caption Data | 300 million pairs [Dataset]. https://datarade.ai/data-products/nexdata-high-quality-image-caption-data-300-million-pairs-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Nov 12, 2025
    Dataset authored and provided by
    Nexdata
    Area covered
    United Kingdom, Portugal, Taiwan, Hong Kong, Austria, Japan, Turkey, Czech Republic, India, Lithuania
    Description

    300 million images, each corresponding to a description. All are genuine image works published by photographers. The vast majority of descriptions are in English, with very few in Chinese.

    Data size

    300 million images, each paired with a textual description. Complete image library (including photographic + vector images) totals nearly 300 million, Full dataset available for generative AI training (curated photographic + vector images excluding editorial/news images) comprises approximately 100 million.

    Data formats

    Image formats: .jpg, .png, .svg; Description format: .txt

    Data content

    Original copyrighted image works officially released by creators, accompanying descriptions authored by content creators.

    Data types

    Photographic images and vector illustrations, covers diverse scene categories.

    Data resolution

    4K and above

    Description languages

    Predominantly English (majority), Minimal Chinese portion.

  20. i

    Presentations SVG Illustrations Dataset

    • illuhub.com
    svg
    Updated Sep 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Illuhub (2025). Presentations SVG Illustrations Dataset [Dataset]. https://illuhub.com/illustrations/office-workplace/presentations
    Explore at:
    svgAvailable download formats
    Dataset updated
    Sep 24, 2025
    Dataset authored and provided by
    Illuhub
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2024 - Present
    Area covered
    Worldwide
    Variables measured
    Category, File Format, Subcategory
    Description

    Specialized collection of 0 free presentations SVG illustrations from the office & workplace category. Presentation scene illustrations with speakers at podiums, slide deck presentations, and product demonstrations Examples include: speaker at podium, slide deck on screen, product demo.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Nexdata (2025). 80M Vector Image Dataset – AI Training & Commercial Use [Dataset]. https://www.nexdata.ai/datasets/computervision/1794
Organization logo

80M Vector Image Dataset – AI Training & Commercial Use

Explore at:
Dataset updated
Apr 7, 2025
Dataset authored and provided by
Nexdata
Variables measured
Data size, Image type, Data format, Data content
Description

This dataset contains 80 million high-quality vector images (SVG, EPS, AI formats), offering a vast collection for use in computer vision, machine learning, and creative applications. Each image is copyright-cleared and legally sourced through authorized channels, with transparent usage rights for both commercial and academic purposes. The dataset features a wide variety of vector content—icons, illustrations, infographics, and more—with excellent color fidelity and scalable resolution. Ideal for AI model training (e.g., image classification, object recognition), generative design models, and creative design inspiration, this resource ensures traceable IP rights and enables safe, large-scale usage in real-world environments.

Search
Clear search
Close search
Google apps
Main menu