100+ datasets found
  1. Human Activity Recognition (HAR - Video Dataset)

    • kaggle.com
    Updated May 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sharjeel M. (2023). Human Activity Recognition (HAR - Video Dataset) [Dataset]. http://doi.org/10.34740/kaggle/dsv/5722068
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 19, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sharjeel M.
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The dataset contains a comprehensive collection of human activity videos, spanning across 7 distinct classes. These classes include clapping, meeting and splitting, sitting, standing still, walking, walking while reading book, and walking while using the phone.

    Each video clip in the dataset showcases a specific human activity and has been labeled with the corresponding class to facilitate supervised learning.

    The primary inspiration behind creating this dataset is to enable machines to recognize and classify human activities accurately. With the advent of computer vision and deep learning techniques, it has become increasingly important to train machine learning models on large and diverse datasets to improve their accuracy and robustness.

  2. m

    Flood Amateur Video for Semantic Segmentation Dataset

    • data.mendeley.com
    Updated May 16, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Naili Suri Intizhami (2024). Flood Amateur Video for Semantic Segmentation Dataset [Dataset]. http://doi.org/10.17632/3kzr8mt8s2.5
    Explore at:
    Dataset updated
    May 16, 2024
    Authors
    Naili Suri Intizhami
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset is flood data in the city of Parepare, South Sulawesi Province, which contains video data collected from social media Instagram. This dataset was created to develop deep learning methods for recognizing floods and surrounding objects, specializing in semantic segmentation methods. This dataset consists of three folders, namely raw video data collected from Instagram, image data resulting from splitting the video into several images, and annotation data containing images that have been color-labeled according to their objects. There are 6 object classifications based on color labels, namely: floods (blue light), buildings (red), plants (green), people (sage), vehicles (orange), and sky (dark blue). This dataset has data in image (JPEG/PNG) and video (MP4) formats. This dataset is suitable for object recognition tasks with the semantic segmentation method. In addition, because this dataset contains original data in the form of videos and images, it can be developed for other purposes in the future. As a note, if you intend to use this dataset, please ensure that you comply with applicable copyright, privacy, and regulatory requirements. If you intend to read the paper about this dataset, please visit this link: https://doi.org/10.1016/j.dib.2023.109768

  3. h

    video-dataset

    • huggingface.co
    Updated May 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ProgramerSalar (2025). video-dataset [Dataset]. https://huggingface.co/datasets/ProgramerSalar/video-dataset
    Explore at:
    Dataset updated
    May 11, 2025
    Authors
    ProgramerSalar
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Video Dataset on Hugging Face

    This repository hosts the video dataset, a widely used benchmark dataset for human action recognition in videos. The dataset has been processed and uploaded to the Hugging Face Hub for easy access, sharing, and integration into machine learning workflows.

      Introduction
    

    The dataset is a large-scale video dataset designed for action recognition tasks. It contains 13,320 video clips across 101 action categories, making it one of the most… See the full description on the dataset page: https://huggingface.co/datasets/ProgramerSalar/video-dataset.

  4. i

    Sintel 4D Light Field Video Dataset

    • ieee-dataport.org
    Updated Mar 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Takahiro Kinoshita (2021). Sintel 4D Light Field Video Dataset [Dataset]. https://ieee-dataport.org/open-access/sintel-4d-light-field-video-dataset
    Explore at:
    Dataset updated
    Mar 26, 2021
    Authors
    Takahiro Kinoshita
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    9x9 views

  5. m

    Video Dataset of Interview for training AI/ML Models

    • data.macgence.com
    mp3
    Updated May 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Macgence (2024). Video Dataset of Interview for training AI/ML Models [Dataset]. https://data.macgence.com/dataset/video-dataset-of-interview-for-training-aiml-models
    Explore at:
    mp3Available download formats
    Dataset updated
    May 29, 2024
    Dataset authored and provided by
    Macgence
    License

    https://data.macgence.com/terms-and-conditionshttps://data.macgence.com/terms-and-conditions

    Time period covered
    2025
    Area covered
    Worldwide
    Variables measured
    Outcome, Call Type, Transcriptions, Audio Recordings, Speaker Metadata, Conversation Topics
    Description

    Comprehensive video dataset of interviews designed to train AI/ML models. Enhance machine learning with diverse, high-quality, and realistic interview scenarios.

  6. u

    Video Emotion Recognition Dataset

    • unidata.pro
    json, mp4
    Updated Mar 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Unidata L.L.C-FZ (2025). Video Emotion Recognition Dataset [Dataset]. https://unidata.pro/datasets/video-emotion-recognition-dataset/
    Explore at:
    json, mp4Available download formats
    Dataset updated
    Mar 19, 2025
    Dataset authored and provided by
    Unidata L.L.C-FZ
    Description

    Video dataset capturing diverse facial expressions and emotions from 1000+ people, suitable for emotion recognition AI training

  7. P

    THVD Dataset

    • library.toponeai.link
    • paperswithcode.com
    Updated Apr 29, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). THVD Dataset [Dataset]. https://library.toponeai.link/dataset/thvd
    Explore at:
    Dataset updated
    Apr 29, 2025
    Description

    About

    We provide a comprehensive talking-head video dataset with over 50,000 videos, totaling more than 500+ hours of footage and featuring 20,841 unique identities from around the world.

    Distribution

    Detailing the format, size, and structure of the dataset:

    -Total Size: 2.7TB

    -Total Videos: 47,547

    -Identities Covered: 20,841

    -Resolution: 60% 4k(1980), 33% fullHD(1080)

    -Formats: MP4

    -Full-length videos with visible mouth movements in every frame.

    -Minimum face size of 400 pixels.

    -Video durations range from 20 seconds to 5 minutes.

    -Faces have not been cut out, full screen videos including backgrounds.

    Usage

    This dataset is ideal for a variety of applications:

    Face Recognition & Verification: Training and benchmarking facial recognition models.

    Action Recognition: Identifying human activities and behaviors.

    Re-Identification (Re-ID): Tracking identities across different videos and environments.

    Deepfake Detection: Developing methods to detect manipulated videos.

    Generative AI: Training high-resolution video generation models.

    Lip Syncing Applications: Enhancing AI-driven lip-syncing models for dubbing and virtual avatars.

    Background AI Applications: Developing AI models for automated background replacement, segmentation, and enhancement.

    Coverage

    Explaining the scope and coverage of the dataset:

    Geographic Coverage: Worldwide

    Time Range: Time range and size of the videos have been noted in the CSV file.

    Demographics: Includes information about age, gender, ethnicity, format, resolution, and file size.

    Languages Covered (Videos):

    English: 23,038 videos

    Portuguese: 1,346 videos

    Spanish: 677 videos

    Norwegian: 1,266 videos

    Swedish: 1,056 videos

    Korean: 848 videos

    Polish: 1,807 videos

    Indonesian: 1,163 videos

    French: 1,102 videos

    German: 1,276 videos

    Japanese: 1,433 videos

    Dutch: 1,666 videos

    Indian: 1,163 videos

    Czech: 590 videos

    Chinese: 685 videos

    Italian: 975 videos

    Philipeans: 920 videos

    Bulgaria: 340 videos

    Romanian: 1144 videos

    Arabic: 1691 videos

    Who Can Use It

    List examples of intended users and their use cases:

    Data Scientists: Training machine learning models for video-based AI applications.

    Researchers: Studying human behavior, facial analysis, or video AI advancements.

    Businesses: Developing facial recognition systems, video analytics, or AI-driven media applications.

    Additional Notes

    Ensure ethical usage and compliance with privacy regulations. The dataset’s quality and scale make it valuable for high-performance AI training. Potential preprocessing (cropping, down sampling) may be needed for different use cases. Dataset has not been completed yet and expands daily, please contact for most up to date CSV file. The dataset has been divided into 100GB zipped files and is hosted on a private server (with the option to upload to the cloud if needed). To verify the dataset's quality, please contact me for the full CSV file. I’d be happy to provide example videos selected by the potential buyer.

    https://lipsynthesis.com/dataset

  8. H

    Data from: VID: A Comprehensive Dataset for Violence Detection in Various...

    • dataverse.harvard.edu
    Updated Jul 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abu Bakar Siddique Mahi; Farhana Sultana Eshita; Tabassum Chowdhury; Rashik Rahman; Tanjina Helaly (2024). VID: A Comprehensive Dataset for Violence Detection in Various Contexts [Dataset]. http://doi.org/10.7910/DVN/N4LNZD
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 15, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Abu Bakar Siddique Mahi; Farhana Sultana Eshita; Tabassum Chowdhury; Rashik Rahman; Tanjina Helaly
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    To address the limitations of current datasets used for training automated crime and violence detection systems, we have created a new, balanced dataset consisting of 3,000 video clips. The dataset, which includes an equal number of violent and non-violent real-world scenarios recorded by non-professional actors, provides a more comprehensive and representative source for the development and assessment of these systems. Security and law enforcement professionals can use this comprehensive approach to analyze surveillance footage and identify pertinent incidents more efficiently.

  9. YouTube Video Popularity Prediction Dataset

    • kaggle.com
    Updated Apr 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Şahide ŞEKER (2025). YouTube Video Popularity Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/sahideseker/youtube-video-popularity-prediction-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Şahide ŞEKER
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Area covered
    YouTube
    Description

    🇬🇧 English:

    This synthetic dataset is designed for predicting the popularity of YouTube videos using metadata. It includes fields like video title, duration, tags, and view count. Useful for regression modeling, feature engineering, and exploring social media analytics.

    Use this dataset to:

    • Build regression models to estimate video views.
    • Explore the impact of title length, tags, and duration on popularity.
    • Practice real-world machine learning tasks without using actual video data.

    Features:

    • video_id: Unique identifier for the video
    • title_length: Number of characters in the title
    • tags_count: Number of tags associated with the video
    • duration_sec: Duration of the video in seconds
    • views: Number of views (target variable)

    🇹🇷 Türkçe:

    Bu sentetik veri seti, YouTube videolarının popülerliğini (izlenme sayısını) tahmin etmek amacıyla oluşturulmuştur. Başlık uzunluğu, etiket sayısı ve video süresi gibi meta verileri içermektedir. Sosyal medya analizi ve regresyon modeli geliştirmek isteyenler için uygundur.

    Bu veri seti sayesinde:

    • Video izlenme sayısını tahmin eden regresyon modelleri geliştirilebilir.
    • Başlık uzunluğu, etiket sayısı ve sürenin popülerlik üzerindeki etkisi incelenebilir.
    • Gerçek video verisi kullanmadan makine öğrenmesi uygulamaları yapılabilir.

    Değişkenler:

    • video_id: Video için benzersiz kimlik
    • title_length: Başlık uzunluğu (karakter sayısı)
    • tags_count: Etiket sayısı
    • duration_sec: Süre (saniye cinsinden)
    • views: İzlenme sayısı (hedef değişken)
  10. d

    SAW-IT-Plus Video Dataset

    • dro.deakin.edu.au
    • researchdata.edu.au
    txt
    Updated Mar 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Don Driscoll; Thi Thu Thuy Nguyen; Anne Eichholtzer (2023). SAW-IT-Plus Video Dataset [Dataset]. http://doi.org/10.26187/deakin.22359847.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Mar 31, 2023
    Dataset provided by
    Deakin University
    Authors
    Don Driscoll; Thi Thu Thuy Nguyen; Anne Eichholtzer
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    The SAW-IT-Plus dataset contains 11,458 videos collected in the wild, and 22 homemade videos (snake category). Videos are arranged in 8 main categories of animals (frogs - 0, snakes - 1, lizards - 2, birds - 3, small mammals < 2kg - 4, medium or large mammals > 2kg - 5, spiders - 7 and scorpions - 8). Echidnas - originally category 6 – were merged with big mammals. Some videos of crustacea and other reptiles are available but not classified. Empty videos (7,896) were added to allow for further testing of the algorithm. They are separated in 3 categories (details in Table 1).

    CSV files are available to detail the species for frogs, lizards, birds and small mammals for each video. Because the videos were mainly collected from real-world data; the number of videos for each animal category are unbalanced (Table 1). This folder also contains training images used to automatically detect videos containing animals in our overall dataset. More information available in the ReadMe files.

    The dataset was collected in Victoria, Australia, from February to October 2021 as part of the ERP22 (formerly ARI-PPD 05) grant.

  11. Real Time Anomaly Detection in CCTV Surveillance

    • kaggle.com
    Updated Apr 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    webadvisor (2023). Real Time Anomaly Detection in CCTV Surveillance [Dataset]. https://www.kaggle.com/datasets/webadvisor/real-time-anomaly-detection-in-cctv-surveillance
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 28, 2023
    Dataset provided by
    Kaggle
    Authors
    webadvisor
    Description

    UCF Crime Dataset in the most suitable structure. Contains 1900 videos from 13 different categories. To ensure the quality of this dataset, it is trained ten annotators (having different levels of computer vision expertise) to collect the dataset. Using videos search on YouTube and LiveLeak using text search queries (with slight variations e.g. “car crash”, “road accident”) of each anomaly.

  12. d

    Face Anti-spoofing Data | 200,000 ID | iBeta Dataset| Liveness Detection...

    • datarade.ai
    Updated Dec 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nexdata (2023). Face Anti-spoofing Data | 200,000 ID | iBeta Dataset| Liveness Detection Data| Image/Video Machine Learning (ML) Data| AI Datasets [Dataset]. https://datarade.ai/data-products/nexdata-face-anti-spoofing-data-200-000-id-image-video-nexdata
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Dec 21, 2023
    Dataset authored and provided by
    Nexdata
    Area covered
    Slovakia, Colombia, Hungary, Dominican Republic, Malta, Tunisia, South Africa, United Kingdom, Pakistan, Iraq
    Description
    1. Specifications Data size : 200,000 ID

    Population distribution : race distribution: Asians, Caucasians, black people; gender distribution: gender balance; age distribution: from child to the elderly, the young people and the middle aged are the majorities

    Collection environment : indoor scenes, outdoor scenes

    Collection diversity : various postures, expressions, light condition, scenes, time periods and distances

    Collection device : iPhone, android phone, iPad

    Collection time : daytime,night

    Image Parameter : the video format is .mov or .mp4, the image format is .jpg

    Accuracy : the accuracy of actions exceeds 97%

    1. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go machine learning (ML) data supports instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/computervision?source=Datarade
  13. P

    EDUVSUM Dataset

    • paperswithcode.com
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Junaid Ahmed Ghauri; Sherzod Hakimov; Ralph Ewerth, EDUVSUM Dataset [Dataset]. https://paperswithcode.com/dataset/eduvsum
    Explore at:
    Authors
    Junaid Ahmed Ghauri; Sherzod Hakimov; Ralph Ewerth
    Description

    EDUVSUM contains educational videos with subtitles from three popular e-learning platforms: Edx,YouTube, and TIB AV-Portal that cover the following topics: crash course on history of science and engineering, computer science, python and web programming, machine learning and computer vision, Internet of things (IoT), and software engineering. In total, the current version of the dataset contains 98 videos with ground truth values annotated by a user with an academic background in computer science.

  14. w

    Open Frames Dataset Repository

    • open-frames.web.app
    zip
    Updated Jun 9, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Open Frames (2025). Open Frames Dataset Repository [Dataset]. https://open-frames.web.app/
    Explore at:
    zip(17.6 GB), zip(9.1 GB), zip(23.5 GB), zip(41.2 GB), zip(72.5 GB)Available download formats
    Dataset updated
    Jun 9, 2025
    Dataset authored and provided by
    Open Frames
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    2024 - 2025
    Area covered
    Global
    Description

    Comprehensive collection of high-quality image and video datasets for computer vision, AI training, and machine learning research.

  15. i

    Video Shot Occlusion Detection DataSet

    • ieee-dataport.org
    Updated Jul 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Junhua Liao (2023). Video Shot Occlusion Detection DataSet [Dataset]. https://ieee-dataport.org/documents/video-shot-occlusion-detection-dataset
    Explore at:
    Dataset updated
    Jul 12, 2023
    Authors
    Junhua Liao
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    namely VSOD

  16. P

    ShareGPT4Video Dataset

    • paperswithcode.com
    Updated Sep 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lin Chen; Xilin Wei; Jinsong Li; Xiaoyi Dong; Pan Zhang; Yuhang Zang; Zehui Chen; Haodong Duan; Bin Lin; Zhenyu Tang; Li Yuan; Yu Qiao; Dahua Lin; Feng Zhao; Jiaqi Wang (2024). ShareGPT4Video Dataset [Dataset]. https://paperswithcode.com/dataset/sharegpt4video
    Explore at:
    Dataset updated
    Sep 3, 2024
    Authors
    Lin Chen; Xilin Wei; Jinsong Li; Xiaoyi Dong; Pan Zhang; Yuhang Zang; Zehui Chen; Haodong Duan; Bin Lin; Zhenyu Tang; Li Yuan; Yu Qiao; Dahua Lin; Feng Zhao; Jiaqi Wang
    Description

    The ShareGPT4Video dataset is a large-scale resource designed to improve video understanding and generation¹. It features 1.2 million highly descriptive captions⁴ for video clips, surpassing existing datasets in diversity and information content⁴. The captions cover a wide range of aspects, including world knowledge, object properties, spatial relationships, and aesthetic evaluations⁴.

    The dataset includes detailed captions of 40K videos generated by GPT-4V¹ and 4.8M videos generated by ShareCaptioner-Video¹. The videos are sourced from YouTube and other user-uploaded video websites, and they cover a variety of scenarios, such as human activities and auto-driving¹.

    The ShareGPT4Video dataset also provides a basis for the ShareCaptioner-Video, an exceptional video captioner capable of efficiently generating high-quality captions for videos of a wide range of resolution, aspect ratio, and duration¹.

    For example, the dataset includes a detailed caption of a video documenting a meticulous meal preparation by an individual with tattooed forearms¹. The caption describes the individual's actions in detail, from slicing a cucumber to mixing the dressing and adding croutons to the salad¹.

    In addition to its use in research, the ShareGPT4Video dataset has been used to train the sharegpt4video-8b model, an open-source video chatbot². This model was trained on open-source video instruction data and is primarily intended for researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence².

    (1) arXiv:2406.04325v1 [cs.CV] 6 Jun 2024. https://arxiv.org/pdf/2406.04325. (2) ShareGPT4V: Improving Large Multi-Modal Models with Better Captions. https://arxiv.org/abs/2311.12793. (3) Lin-Chen/sharegpt4video-8b · Hugging Face. https://huggingface.co/Lin-Chen/sharegpt4video-8b. (4) ShareGPT4Video: Improving Video Understanding and Generation with .... https://www.aimodels.fyi/papers/arxiv/sharegpt4video-improving-video-understanding-generation-better-captions. (5) GitHub - ShareGPT4Omni/ShareGPT4Video: An official implementation of .... https://github.com/ShareGPT4Omni/ShareGPT4Video. (6) undefined. https://sharegpt4video.github.io/.

  17. g

    Workout/Exercises Video

    • gts.ai
    • kaggle.com
    json
    Updated Jun 13, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2024). Workout/Exercises Video [Dataset]. https://gts.ai/dataset-download/workout-exercises-video/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jun 13, 2024
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This dataset was created by myself. This dataset contains videos of people doing workouts. The name of the existing workout corresponds to the name of the folder listed.

  18. g

    WLASL (World Level American Sign Language) Video

    • gts.ai
    json
    Updated Jan 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GTS (2025). WLASL (World Level American Sign Language) Video [Dataset]. https://gts.ai/dataset-download/wlasl-world-level-american-sign-language-video/
    Explore at:
    jsonAvailable download formats
    Dataset updated
    Jan 25, 2025
    Dataset provided by
    GLOBOSE TECHNOLOGY SOLUTIONS PRIVATE LIMITED
    Authors
    GTS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    World, United States
    Description

    WLASL (Word-Level American Sign Language) dataset with 12,000 processed videos covering 2,000 ASL words. Ideal for research and machine learning in sign language recognition, licensed under C-UDA

  19. i

    Ego-SLD: A Video Dataset of Egocentric Action Recognition for Bengali Sign...

    • ieee-dataport.org
    Updated Jan 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Asfak Ali (2025). Ego-SLD: A Video Dataset of Egocentric Action Recognition for Bengali Sign Language Detection [Dataset]. https://ieee-dataport.org/documents/ego-sld-video-dataset-egocentric-action-recognition-bengali-sign-language-detection
    Explore at:
    Dataset updated
    Jan 19, 2025
    Authors
    Asfak Ali
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    postures

  20. i

    Dronevision: A Dataset of Aerial Videos for Computer Vision Applications

    • ieee-dataport.org
    Updated May 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ninad Mehendale (2023). Dronevision: A Dataset of Aerial Videos for Computer Vision Applications [Dataset]. https://ieee-dataport.org/documents/dronevision-dataset-aerial-videos-computer-vision-applications
    Explore at:
    Dataset updated
    May 11, 2023
    Authors
    Ninad Mehendale
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    datasets of aerial videos captured from drones are essential.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sharjeel M. (2023). Human Activity Recognition (HAR - Video Dataset) [Dataset]. http://doi.org/10.34740/kaggle/dsv/5722068
Organization logo

Human Activity Recognition (HAR - Video Dataset)

The dataset features 7 different classes of Human Activities in Videos.

Explore at:
3 scholarly articles cite this dataset (View in Google Scholar)
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 19, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sharjeel M.
License

Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically

Description

The dataset contains a comprehensive collection of human activity videos, spanning across 7 distinct classes. These classes include clapping, meeting and splitting, sitting, standing still, walking, walking while reading book, and walking while using the phone.

Each video clip in the dataset showcases a specific human activity and has been labeled with the corresponding class to facilitate supervised learning.

The primary inspiration behind creating this dataset is to enable machines to recognize and classify human activities accurately. With the advent of computer vision and deep learning techniques, it has become increasingly important to train machine learning models on large and diverse datasets to improve their accuracy and robustness.

Search
Clear search
Close search
Google apps
Main menu