4 datasets found
  1. Google Landmarks Dataset v2

    • github.com
    • opendatalab.com
    Updated Sep 27, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google (2019). Google Landmarks Dataset v2 [Dataset]. https://github.com/cvdfoundation/google-landmark
    Explore at:
    Dataset updated
    Sep 27, 2019
    Dataset provided by
    Googlehttp://google.com/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This is the second version of the Google Landmarks dataset (GLDv2), which contains images annotated with labels representing human-made and natural landmarks. The dataset can be used for landmark recognition and retrieval experiments. This version of the dataset contains approximately 5 million images, split into 3 sets of images: train, index and test. The dataset was presented in our CVPR'20 paper. In this repository, we present download links for all dataset files and relevant code for metric computation. This dataset was associated to two Kaggle challenges, on landmark recognition and landmark retrieval. Results were discussed as part of a CVPR'19 workshop. In this repository, we also provide scores for the top 10 teams in the challenges, based on the latest ground-truth version. Please visit the challenge and workshop webpages for more details on the data, tasks and technical solutions from top teams.

  2. n

    Infaunal marine invertebrate fauna inside and outside of bacterial mats,...

    • access.earthdata.nasa.gov
    • researchdata.edu.au
    • +3more
    Updated Mar 15, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Infaunal marine invertebrate fauna inside and outside of bacterial mats, Casey 2006-07 [Dataset]. http://doi.org/10.26179/5c8b147c9568b
    Explore at:
    Dataset updated
    Mar 15, 2019
    Time period covered
    Nov 10, 2006 - Dec 7, 2006
    Area covered
    Description

    Infaunal marine invertebrates were collected from inside and outside of patches of white bacterial mats from several sites in the Windmill Islands, Antarctica, around Casey station during the 2006-07 summer. Samples were collected from McGrady Cove inner and outer, the tide gauge near the Casey wharf, Stevenson's Cove and Brown Bay inner. Sediment cores of 10cm depth and 5cm diameter were collected by divers using a PVC corer from inside (4 cores) and outside (4 cores) each bacterial patch. The size of each patch varied from site to site. Cores were sieved at 500 microns and the extracted fauna preserved in 4 percent neutral buffered formalin. All fauna were counted and identified to species where possible or assigned to morphospecies based on previous infaunal sampling around Casey.

    An excel spreadsheet is available for download at the URL given below. The spreadsheet does not represent the complete dataset, and is only the bacterial mat infauna data.

    Regarding the infauna dataset:

    • in - in the mat or patch of bacteria and out is in the "normal" sediment surrounding the patch without evidence of any bacterial mat presence.
    • Patch numbers were allocated to ensure there was no confusion between patches in the same area.
    • Fauna names are our identification codes for each species. Some we have confirmed identifications for, some not. Species names, where we have them and as we get them, are listed against these codes in the Casey marine soft-sediment fauna identification guide.

    This work was completed as part of ASAC 2201 (ASAC_2201).

  3. T

    imagenet2012

    • tensorflow.org
    Updated Jun 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). imagenet2012 [Dataset]. https://www.tensorflow.org/datasets/catalog/imagenet2012
    Explore at:
    Dataset updated
    Jun 1, 2024
    Description

    ILSVRC 2012, commonly known as 'ImageNet' is an image dataset organized according to the WordNet hierarchy. Each meaningful concept in WordNet, possibly described by multiple words or word phrases, is called a "synonym set" or "synset". There are more than 100,000 synsets in WordNet, majority of them are nouns (80,000+). In ImageNet, we aim to provide on average 1000 images to illustrate each synset. Images of each concept are quality-controlled and human-annotated. In its completion, we hope ImageNet will offer tens of millions of cleanly sorted images for most of the concepts in the WordNet hierarchy.

    The test split contains 100K images but no labels because no labels have been publicly released. We provide support for the test split from 2012 with the minor patch released on October 10, 2019. In order to manually download this data, a user must perform the following operations:

    1. Download the 2012 test split available here.
    2. Download the October 10, 2019 patch. There is a Google Drive link to the patch provided on the same page.
    3. Combine the two tar-balls, manually overwriting any images in the original archive with images from the patch. According to the instructions on image-net.org, this procedure overwrites just a few images.

    The resulting tar-ball may then be processed by TFDS.

    To assess the accuracy of a model on the ImageNet test split, one must run inference on all images in the split, export those results to a text file that must be uploaded to the ImageNet evaluation server. The maintainers of the ImageNet evaluation server permits a single user to submit up to 2 submissions per week in order to prevent overfitting.

    To evaluate the accuracy on the test split, one must first create an account at image-net.org. This account must be approved by the site administrator. After the account is created, one can submit the results to the test server at https://image-net.org/challenges/LSVRC/eval_server.php The submission consists of several ASCII text files corresponding to multiple tasks. The task of interest is "Classification submission (top-5 cls error)". A sample of an exported text file looks like the following:

    771 778 794 387 650
    363 691 764 923 427
    737 369 430 531 124
    755 930 755 59 168
    

    The export format is described in full in "readme.txt" within the 2013 development kit available here: https://image-net.org/data/ILSVRC/2013/ILSVRC2013_devkit.tgz Please see the section entitled "3.3 CLS-LOC submission format". Briefly, the format of the text file is 100,000 lines corresponding to each image in the test split. Each line of integers correspond to the rank-ordered, top 5 predictions for each test image. The integers are 1-indexed corresponding to the line number in the corresponding labels file. See labels.txt.

    To use this dataset:

    import tensorflow_datasets as tfds
    
    ds = tfds.load('imagenet2012', split='train')
    for ex in ds.take(4):
     print(ex)
    

    See the guide for more informations on tensorflow_datasets.

    https://storage.googleapis.com/tfds-data/visualization/fig/imagenet2012-5.1.0.png" alt="Visualization" width="500px">

  4. n

    Annual population estimates of Southern Elephant Seals at Macquarie Island...

    • access.earthdata.nasa.gov
    • researchdata.edu.au
    • +2more
    Updated Jul 17, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Annual population estimates of Southern Elephant Seals at Macquarie Island from censuses made annually on October 15th. [Dataset]. https://access.earthdata.nasa.gov/collections/C1214311315-AU_AADC
    Explore at:
    Dataset updated
    Jul 17, 2019
    Time period covered
    Oct 15, 1985 - Present
    Area covered
    Description

    INDICATOR DEFINITION Count of all adult females, fully weaned pups and dead pups hauled out on, or close to, the day of maximum cow numbers, set for October 15.

    TYPE OF INDICATOR There are three types of indicators used in this report: 1.Describes the CONDITION of important elements of a system; 2.Show the extent of the major PRESSURES exerted on a system; 3.Determine RESPONSES to either condition or changes in the condition of a system.

    This indicator is one of: CONDITION

    RATIONALE FOR INDICATOR SELECTION Elephant seals from Macquarie Island are long distance foragers who can utilise the Southern Ocean both west as far as Heard Island and east as far as the Ross Sea. Thus their populations reflect foraging conditions across a vast area.

    The slow decline in their numbers (-2.3% annually from 1988-1993) suggests that their ocean foraging has been more difficult in recent decades. Furthermore, interactions with humans are negligible due to the absence of significant overlap in their diet with commercial fisheries. This suggests that changes in 'natural' ocean conditions may have altered aspects of prey availability. It is clear that seal numbers are changing in response to ocean conditions but at the moment these conditions cannot be specified.

    DESIGN AND STRATEGY FOR INDICATOR MONITORING PROGRAM Spatial Scale: Five beaches on Macquarie Island (lat54 degrees 37' 59.9' S, long 158 degrees 52' 59.9' E): North Head to Aurora Point; Aurora Point to Caroline Cove; Garden Cove to Sandy Bay; Sandy Bay to Waterfall Bay; Waterfall Bay to Hurd Point.

    Frequency: Annual census on 15th October

    Measurement Technique: Monitoring the Southern Elephant Seal population on Macquarie island requires a one day whole island adult female census on October 15 and a daily count of cow numbers, fully weaned pups and dead pups on the west and east isthmus beaches throughout October.

    Daily cow counts during October, along the isthmus beaches close to the Station, provide data to identify exactly the day of maximum numbers. The isthmus counts are recorded under the long-established (since 1950) harem names. Daily counts allow adjustment to the census totals if the day of maximum numbers of cows ashore happens to fall on either side of October 15. Personnel need to be dispersed around the island by October 15 so that all beaches are counted for seals on that day. This has been achieved successfully for the last 15 years.

    On the day of maximum haul out (around 15th October) the only Elephant seals present are cows, their young pups and adult males. The three classes can be readily distinguished and counted accurately. Lactating pups are not counted, their numbers are provided by the cow count on a 1:1 proportion. The combined count of cows, fully weaned pups and dead pups provides an index of pup production.

    The count of any group is made until there is agreement between counts to better than +/- 5%. Thus there is always a double count as a minimum; the number of counts can reach double figures when a large group is enumerated. The largest single group on Macquarie Island is that at West Razorback with greater than 1,000 cows; Multiple counts are always required there.

    RESEARCH ISSUES Much research has been done already to acquire demographic data so that population models can be produced. Thus there will be predicted population sizes for elephant seals on Macquarie Island in 2002 onwards and the annual censuses will allow these predictions to be tested against the actual numbers. The censuses are also a check on the population status of this endangered species.

    LINKS TO OTHER INDICATORS

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Google (2019). Google Landmarks Dataset v2 [Dataset]. https://github.com/cvdfoundation/google-landmark
Organization logo

Google Landmarks Dataset v2

Explore at:
372 scholarly articles cite this dataset (View in Google Scholar)
Dataset updated
Sep 27, 2019
Dataset provided by
Googlehttp://google.com/
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

This is the second version of the Google Landmarks dataset (GLDv2), which contains images annotated with labels representing human-made and natural landmarks. The dataset can be used for landmark recognition and retrieval experiments. This version of the dataset contains approximately 5 million images, split into 3 sets of images: train, index and test. The dataset was presented in our CVPR'20 paper. In this repository, we present download links for all dataset files and relevant code for metric computation. This dataset was associated to two Kaggle challenges, on landmark recognition and landmark retrieval. Results were discussed as part of a CVPR'19 workshop. In this repository, we also provide scores for the top 10 teams in the challenges, based on the latest ground-truth version. Please visit the challenge and workshop webpages for more details on the data, tasks and technical solutions from top teams.

Search
Clear search
Close search
Google apps
Main menu