27 datasets found

SAS code used to analyze data and a datafile with metadata glossary
catalog.data.gov
res1catalogd-o-tdatad-o-tgov.vcapture.xyz
+1more
Updated Nov 12, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. EPA Office of Research and Development (ORD) (2020). SAS code used to analyze data and a datafile with metadata glossary [Dataset]. https://catalog.data.gov/dataset/sas-code-used-to-analyze-data-and-a-datafile-with-metadata-glossary
Explore at:
Dataset updated
Nov 12, 2020
Dataset provided by
United States Environmental Protection Agencyhttp://www.epa.gov/
Description
We compiled macroinvertebrate assemblage data collected from 1995 to 2014 from the St. Louis River Area of Concern (AOC) of western Lake Superior. Our objective was to define depth-adjusted cutoff values for benthos condition classes (poor, fair, reference) to provide tool useful for assessing progress toward achieving removal targets for the degraded benthos beneficial use impairment in the AOC. The relationship between depth and benthos metrics was wedge-shaped. We therefore used quantile regression to model the limiting effect of depth on selected benthos metrics, including taxa richness, percent non-oligochaete individuals, combined percent Ephemeroptera, Trichoptera, and Odonata individuals, and density of ephemerid mayfly nymphs (Hexagenia). We created a scaled trimetric index from the first three metrics. Metric values at or above the 90th percentile quantile regression model prediction were defined as reference condition for that depth. We set the cutoff between poor and fair condition as the 50th percentile model prediction. We examined sampler type, exposure, geographic zone of the AOC, and substrate type for confounding effects. Based on these analyses we combined data across sampler type and exposure classes and created separate models for each geographic zone. We used the resulting condition class cutoff values to assess the relative benthic condition for three habitat restoration project areas. The depth-limited pattern of ephemerid abundance we observed in the St. Louis River AOC also occurred elsewhere in the Great Lakes. We provide tabulated model predictions for application of our depth-adjusted condition class cutoff values to new sample data. This dataset is associated with the following publication: Angradi, T., W. Bartsch, A. Trebitz, V. Brady, and J. Launspach. A depth-adjusted ambient distribution approach for setting numeric removal targets for a Great Lakes Area of Concern beneficial use impairment: Degraded benthos. JOURNAL OF GREAT LAKES RESEARCH. International Association for Great Lakes Research, Ann Arbor, MI, USA, 43(1): 108-120, (2017).
e
Editing EU-SILC UDB Longitudinal Data for Differential Mortality Analyses....
b2find.eudat.eu
Updated Jun 28, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Editing EU-SILC UDB Longitudinal Data for Differential Mortality Analyses. SAS code and documentation. - Dataset - B2FIND [Dataset]. https://b2find.eudat.eu/dataset/b46561d2-3e25-5315-b4e4-4be8f4bd05ac
Explore at:
Dataset updated
Jun 28, 2024
Description
This SAS code extracts data from EU-SILC User Database (UDB) longitudinal files and edits it such that a file is produced that can be further used for differential mortality analyses. Information from the original D, R, H and P files is merged per person and possibly pooled over several longitudinal data releases. Vital status information is extracted from target variables DB110 and RB110, and time at risk between the first interview and either death or censoring is estimated based on quarterly date information. Apart from path specifications, the SAS code consists of several SAS macros. Two of them require parameter specification from the user. The other ones are just executed. The code was written in Base SAS, Version 9.4. By default, the output file contains several variables which are necessary for differential mortality analyses, such as sex, age, country, year of first interview, and vital status information. In addition, the user may specify the analytical variables by which mortality risk should be compared later, for example educational level or occupational class. These analytical variables may be measured either at the first interview (the baseline) or at the last interview of a respondent. The output file is available in SAS format and by default also in csv format.
E
SAS: Semantic Artist Similarity Dataset
live.european-language-grid.eu
zenodo.org
txt
Updated Oct 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2023). SAS: Semantic Artist Similarity Dataset [Dataset]. https://live.european-language-grid.eu/catalogue/corpus/7418
Explore at:
txtAvailable download formats
Dataset updated
Oct 28, 2023
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The Semantic Artist Similarity dataset consists of two datasets of artists entities with their corresponding biography texts, and the list of top-10 most similar artists within the datasets used as ground truth. The dataset is composed by a corpus of 268 artists and a slightly larger one of 2,336 artists, both gathered from Last.fm in March 2015. The former is mapped to the MIREX Audio and Music Similarity evaluation dataset, so that its similarity judgments can be used as ground truth. For the latter corpus we use the similarity between artists as provided by the Last.fm API. For every artist there is a list with the top-10 most related artists. In the MIREX dataset there are 188 artists with at least 10 similar artists, the other 80 artists have less than 10 similar artists. In the Last.fm API dataset all artists have a list of 10 similar artists. There are 4 files in the dataset.mirex_gold_top10.txt and lastfmapi_gold_top10.txt have the top-10 lists of artists for every artist of both datasets. Artists are identified by MusicBrainz ID. The format of the file is one line per artist, with the artist mbid separated by a tab with the list of top-10 related artists identified by their mbid separated by spaces.artist_mbid \t artist_mbid_top10_list_separated_by_spaces mb2uri_mirex and mb2uri_lastfmapi.txt have the list of artists. In each line there are three fields separated by tabs. First field is the MusicBrainz ID, second field is the last.fm name of the artist, and third field is the DBpedia uri.artist_mbid \t lastfm_name \t dbpedia_uri There are also 2 folders in the dataset with the biography texts of each dataset. Each .txt file in the biography folders is named with the MusicBrainz ID of the biographied artist. Biographies were gathered from the Last.fm wiki page of every artist.Using this datasetWe would highly appreciate if scientific publications of works partly based on the Semantic Artist Similarity dataset quote the following publication:Oramas, S., Sordo M., Espinosa-Anke L., & Serra X. (In Press). A Semantic-based Approach for Artist Similarity. 16th International Society for Music Information Retrieval Conference.We are interested in knowing if you find our datasets useful! If you use our dataset please email us at mtg-info@upf.edu and tell us about your research. https://www.upf.edu/web/mtg/semantic-similarity
d
DHS data extractors for Stata
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Emily Oster (2023). DHS data extractors for Stata [Dataset]. http://doi.org/10.7910/DVN/RRX3QD
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/RRX3QD
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Emily Oster
Description
This package contains two files designed to help read individual level DHS data into Stata. The first file addresses the problem that versions of Stata before Version 7/SE will read in only up to 2047 variables and most of the individual files have more variables than that. The file will read in the .do, .dct and .dat file and output new .do and .dct files with only a subset of the variables specified by the user. The second file deals with earlier DHS surveys in which .do and .dct file do not exist and only .sps and .sas files are provided. The file will read in the .sas and .sps files and output a .dct and .do file. If necessary the first file can then be run again to select a subset of variables.
f
The coefficients ai, bi, ci, di, ei, fi of the affine transforms wn (first...
plos.figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eugen Mircea Anitas; Azat Slyamov (2023). The coefficients ai, bi, ci, di, ei, fi of the affine transforms wn (first column) for deterministic algorithm in Eq (10), and the probabilities pn (last column) for random iteration algorithm in Eq (9). [Dataset]. http://doi.org/10.1371/journal.pone.0181385.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0181385.t001
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS ONE
Authors
Eugen Mircea Anitas; Azat Slyamov
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
The coefficients ai, bi, ci, di, ei, fi of the affine transforms wn (first column) for deterministic algorithm in Eq (10), and the probabilities pn (last column) for random iteration algorithm in Eq (9).
Situation Assessment Survey, 2003 - India
microdata.fao.org
Updated Jul 22, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Sample Survey Organization (2020). Situation Assessment Survey, 2003 - India [Dataset]. https://microdata.fao.org/index.php/catalog/1277
Explore at:
Dataset updated
Jul 22, 2020
Dataset provided by
National Sample Survey Organisation
Authors
National Sample Survey Organization
Time period covered
2003
Area covered
India
Description
Abstract

Millions of farmers in India have made significant contributions in providing food and nutrition to the entire nation, while also providing livelihoods to millions of people in the country. During the past five decades of planned economic development, India has moved from food-shortage and imports to self-sufficiency and exports. Food security and well being of the farmer appears to be major areas of concern of the planners and policy makers of Indian agriculture. In order to have a comprehensive picture of the farming community at the commencement of the third millennium, and to analyze the impact of the transformation induced by public policy, investments and technological change on the farmers' access to resources and income, as well as well-being; the Ministry of Agriculture decided to collect information on Indian farmers through a Situation Assessment Survey (SAS) and entrusted the job of conducting the survey to the National Sample Survey Organisation (NSSO).

The SAS 2003 is the first of its kind to be conducted by NSSO. Though information on a majority of items to be collected through SAS have been collected in some round or other of NSS, an integrated schedule - Schedule 33, covering some basic characteristics of farming households and their access to basic and modern farming resources was canvassed for the first time in SAS. Moreover, information on consumption of various goods and services in an abridged form were also collected to have an idea about the pattern of consumption expenditure of the farming households.

Schedule 33 was designed for collecting information on aspects relating to farming and other socio-economic characteristics of farming households. The information was collected in two visits to the same set of sample households. The first visit was made during January to August 2003 and the second, during September to December 2003. The survey was conducted in rural areas only. It was canvassed in the Central Sample except for the States of Maharashtra and Meghalaya where it was canvassed in both State and Central samples.

Geographic coverage

National Coverage

Analysis unit

Households

Kind of data

Sample survey data [ssd]

Sampling procedure

A stratified multi-stage sampling design was adopted for the SAS 2003, 59th round. The First Stage Unit (FSU), also known as the primary sampling unit, was the census village in the rural sector and UFS block in the urban sector. The Ultimate Stage Units (USUs) were households in both sectors. Hamlet-group / sub-block constitute the intermediate stage, if these are formed in the selected area.

The list of villages (panchayat wards for Kerala) based on the Population Census of 1991 constituted the sampling frame for FSUs in rural areas, while the latest UFS frame was the sampling frame used for urban areas. For stratification of towns by size class, provisional population of towns as per Census 2001 was used. A detailed description of the sampling strrategy can be found in the estimation procedure document attached in the documentation/external resource.

Mode of data collection

Face-to-face paper [f2f]
E
Spoofing and Anti-Spoofing (SAS) corpus v1.0
dtechtive.com
find.data.gov.scot
+1more
gz, pdf, txt
Updated May 27, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
University of Edinburgh. The Centre for Speech Technology Research (CSTR) (2015). Spoofing and Anti-Spoofing (SAS) corpus v1.0 [Dataset]. http://doi.org/10.7488/ds/252
Explore at:
txt(0.001 MB), gz(7773.184 MB), txt(0.0166 MB), gz(3306.496 MB), gz(10065.92 MB), gz(7763.968 MB), gz(10280.96 MB), gz(7478.272 MB), gz(6644.736 MB), gz(7974.912 MB), gz(6674.432 MB), gz(9846.784 MB), pdf(0.1048 MB), gz(9935.872 MB), gz(10393.6 MB), gz(7985.152 MB), gz(10240 MB)Available download formats
Unique identifier
https://doi.org/10.7488/ds/252
Dataset updated
May 27, 2015
Dataset provided by
University of Edinburgh. The Centre for Speech Technology Research (CSTR)
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This dataset is associated with the paper ''SAS: A speaker verification spoofing database containing diverse attacks': presents the first version of a speaker verification spoofing and anti-spoofing database, named SAS corpus. The corpus includes nine spoofing techniques, two of which are speech synthesis, and seven are voice conversion. We design two protocols, one for standard speaker verification evaluation, and the other for producing spoofing materials. Hence, they allow the speech synthesis community to produce spoofing materials incrementally without knowledge of speaker verification spoofing and anti-spoofing. To provide a set of preliminary results, we conducted speaker verification experiments using two state-of-the-art systems. Without any anti-spoofing techniques, the two systems are extremely vulnerable to the spoofing attacks implemented in our SAS corpus'. N.B. the files in the following fileset should also be taken as part of the same dataset as those provided here: Wu et al. (2017). Key files for Spoofing and Anti-Spoofing (SAS) corpus v1.0, [dataset]. University of Edinburgh. The Centre for Speech Technology Research (CSTR). http://hdl.handle.net/10283/2741
d
Synthetic Aperture Sonar Survey to Locate Archaeological Resources in the...
catalog.data.gov
res1catalogd-o-tdatad-o-tgov.vcapture.xyz
+1more
Updated Oct 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(Point of Contact) (2024). Synthetic Aperture Sonar Survey to Locate Archaeological Resources in the Stellwagen Bank National Marine Sanctuary on NOAA Office of National Marine Sanctuaries vessel SRVx between 20100823 and 20100901 [Dataset]. https://catalog.data.gov/dataset/synthetic-aperture-sonar-survey-to-locatearchaeological-resources-in-the-stellwagen-bank-nation2
Explore at:
Dataset updated
Oct 19, 2024
Dataset provided by
(Point of Contact)
Area covered
Gerry E. Studds/Stellwagen Bank National Marine Sanctuary
Description
SAS technology exemplifies recent advances in geophysical survey technology that will revolutionize maritime archaeological remote sensing. Applied Signal Technology (AST) has combined their SAS with the MacArtney FOCUS-2 ROTV to create the ultimate towed acoustic imaging device, PROSAS Surveyor. Capable of an area coverage rate of 2.5 kilometer/hour with a resolution of 3 centimeters, PROSAS Surveyor will greatly expand capabilities to locate even the oldest archaeological sites on the continental shelf, particularly where sedimentation is limited. Large area seafloor mapping at a resolution capable of imaging very small targets is a tremendously expensive proposition for submerged land managers responsible for bottom lands from 30 to 300 meters in depth. Daily operating costs for a suitable research vessel and personnel limit the area that can be investigated. This project will for the first time apply commercially available SAS technology to the search for historic shipwrecks. The rapidity and resolution of this project's survey will be as much as a four-fold increase in area covered as compared to conventional marine archaeological remote sensing survey.
h
SAS-Bench
huggingface.co
Updated Jun 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Peichao Lai (2025). SAS-Bench [Dataset]. https://huggingface.co/datasets/aleversn/SAS-Bench
Explore at:
Dataset updated
Jun 17, 2025
Authors
Peichao Lai
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models

Dataset | 中文 | Paper | Code

🔍 Overview

SAS-Bench represents the first specialized benchmark for evaluating Large Language Models (LLMs) on Short Answer Scoring (SAS) tasks. Utilizing authentic questions from China's National College Entrance Examination (Gaokao)… See the full description on the dataset page: https://huggingface.co/datasets/aleversn/SAS-Bench.
h
Data from: Dual kinetic and structural role for the surface in guiding SAS-6...
heidata.uni-heidelberg.de
Updated Nov 4, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Svenja de Buhr; Frauke Gräter; Frauke Gräter; Svenja de Buhr (2021). Dual kinetic and structural role for the surface in guiding SAS-6 self-assembly to direct centriole architecture [Data] [Dataset]. http://doi.org/10.11588/DATA/3NKHAY
Explore at:
zip(6801360), zip(4924793839), zip(1963), text/plain; charset=us-ascii(1737), zip(13209834), txt(4078), zip(1752945)Available download formats
Unique identifier
https://doi.org/10.11588/DATA/3NKHAY
Dataset updated
Nov 4, 2021
Dataset provided by
heiDATA
Authors
Svenja de Buhr; Frauke Gräter; Frauke Gräter; Svenja de Buhr
License
https://heidata.uni-heidelberg.de/api/datasets/:persistentId/versions/1.1/customlicense?persistentId=doi:10.11588/DATA/3NKHAYhttps://heidata.uni-heidelberg.de/api/datasets/:persistentId/versions/1.1/customlicense?persistentId=doi:10.11588/DATA/3NKHAY
Dataset funded by
Carl Zeiss Foundation
bwHPC
Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)
Klaus Tschira Foundation
Description
This dataset contains input structures and parameters for coarse-grained molecular dynamics simulation of SAS-6 protein oligomers as well as post-processing files and analysis scripts. Abstract of related publication: Discovering mechanisms governing organelle assembly is a fundamental pursuit in the life sciences. The centriole is an evolutionarily conserved organelle with a signature 9-fold symmetrical chiral arrangement of microtubules imparted onto the cilium it templates. The first structure in nascent centrioles is a cartwheel, which comprises stacked 9-fold symmetrical SAS-6 ring polymers and emerging orthogonal to a surface surrounding resident centrioles. The mechanisms through which SAS-6 polymerization ensures centriole organelle architecture remain elusive. We deployed photothermally-actuated off-resonance tapping high-speed atomic force microscopy (PORT-HS-AFM) to decipher surface SAS-6 self-assembly mechanisms. We discovered that the surface shifts the reaction equilibrium by ~104 compared to solution. Moreover, coarse-grained molecular dynamics simulations and PORT-HS-AFM revealed that the surface converts the inherently helical propensity of SAS-6 polymers into 9-fold rings with residual asymmetry, which may guide ring stacking and impart chiral features to centrioles and cilia. Overall, our work reveals fundamental design principles governing centriole assembly.
Situation Assessment Survey of Agricultural Households, January - December...
microdata.gov.in
Updated Mar 27, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Sample Survey Organization (2019). Situation Assessment Survey of Agricultural Households, January - December 2013 - India [Dataset]. https://microdata.gov.in/NADA/index.php/catalog/133
Explore at:
Dataset updated
Mar 27, 2019
Dataset provided by
National Sample Survey Organisation
Authors
National Sample Survey Organization
Time period covered
2013
Area covered
India
Description
Abstract

In order to have a comprehensive picture of the farming community and to analyze the impact of the transformation induced by public policy, investments and technological change on the farmers' access to resources and income as well as well-being of the farmer households it was decided to collect information on Indian farmers through “Situation Assessment Survey” (SAS). The areas of interest for conducting SAS would include economic well-being of farmer households as measured by consumer expenditure, income and productive assets, and indebtedness; their farming practices and preferences, resource availability, and their awareness of technological developments and access to modern technology in the field of agriculture. In this survey, detailed information would be collected on receipts and expenses of households' farm and non-farm businesses, to arrive at their income from these sources. Income from other sources would also be ascertained, and so would be the consumption expenditure of the households.

Geographic coverage

National, State, Rural, Urban

Analysis unit

Houdeholds

Universe

All Households of the type : 1-self-employed in agriculture 2-self-employed in non-agriculture 3-regular wage/salary earning 4-casual labour in agriculture 5-casual labour in non-agriculture 6-others

Kind of data

Sample survey data [ssd]

Sampling procedure

Total sample size (FSUs): 8042 FSUs have been allocated for the central sample at all-India level. For the state sample, there are 8998 FSUs allocated for all-India. sample design: A stratified multi-stage design has been adopted for the 70th round survey. The first stage units (FSU) are the census villages (Panchayat wards in case of Kerala) in the rural sector and Urban Frame Survey (UFS) blocks in the urban sector. The ultimate stage units (USU) are households in both the sectors. In case of large FSUs, one intermediate stage of sampling is the selection of two hamlet-groups (hgs)/ sub-blocks (sbs) from each rural/ urban FSU.

Sampling Frame for First Stage Units: For the rural sector, the list of 2001 census villages updated by excluding the villages urbanised and including the towns de-urbanised after 2001 census (henceforth the term 'village' would mean Panchayat wards for Kerala) constitutes the sampling frame. For the urban sector, the latest updated list of UFS blocks (2007-12) is considered as the sampling frame.

Stratification:

(a) Stratum has been formed at district level. Within each district of a State/ UT, generally speaking, two basic strata have been formed: i) rural stratum comprising of all rural areas of the district and (ii) urban stratum comprising all the urban areas of the district. However, within the urban areas of a district, if there were one or more towns with population 10 lakhs or more as per population census 2011 in a district, each of them formed a separate basic stratum and the remaining urban areas of the district was considered as another basic stratum.

(b) However, a special stratum in the rural sector only was formed at State/UT level before district- strata were formed in case of each of the following 20 States/UTs: Andaman & Nicobar Islands, Andhra Pradesh, Assam, Bihar, Chhattisgarh, Delhi, Goa, Gujarat, Haryana, Jharkhand, Karnataka, Lakshadweep, Madhya Pradesh, Maharashtra, Odisha, Punjab, Rajasthan, Tamil Nadu, Uttar Pradesh and West Bengal. This stratum will comprise all the villages of the State with population less than 50 as per census 2001.

(c) In case of rural sectors of Nagaland one special stratum has been formed within the State consisting of all the interior and inaccessible villages. Similarly, for Andaman & Nicobar Islands, one more special stratum has been formed within the UT consisting of all inaccessible villages. Thus for Andaman & Nicobar Islands, two special strata have been formed at the UT level:

(i) special stratum 1 comprising all the interior and inaccessible villages (ii) special stratum 2 containing all the villages, other than those in special stratum 1, having population less than 50 as per census 2001.

Sub-stratification:

Rural sector: Different sub-stratifications are done for 'hilly' States and other States. Ten (10) States are considered as hilly States. They are: Jammu & Kashmir, Himachal Pradesh, Uttarakhand, Sikkim, Meghalaya, Tripura, Mizoram, Manipur, Nagaland and Arunachal Pradesh.

(a) sub-stratification for hilly States: If 'r' be the sample size allocated for a rural stratum, the number of sub-strata formed was 'r/2'. The villages within a district as per frame have been first arranged in ascending order of population. Then sub-strata 1 to 'r/2' have been demarcated in such a way that each sub-stratum comprised a group of villages of the arranged frame and have more or less equal population.

(b) sub-stratification for other States (non-hilly States except Kerala): The villages within a district as per frame were first arranged in ascending order of proportion of irrigated area in the cultivated area of the village. Then sub-strata 1 to 'r/2' have been demarcated in such a way that each sub-stratum comprised a group of villages of the arranged frame and have more or less equal cultivated area. The information on irrigated area and cultivated area was obtained from the village directory of census 2001.

(c) sub-stratification for Kerala: Although Kerala is a non-hilly State but because of non-availability of information on irrigation at FSU (Panchayat Ward) level, sub-stratification by proportion of irrigated area was not possible. Hence the procedure for sub-stratification was same as that of hilly States in case of Kerala.

Urban sector: There was no sub-stratification for the strata of million plus cities. For other strata, each district was divided into 2 sub-strata as follows:

sub-stratum 1: all towns of the district with population less than 50000 as per census 2011 sub-stratum 2: remaining non-million plus towns of the district

Allocation of total sample to States and UTs: The total number of sample FSUs have been allocated to the States and UTs in proportion to population as per census 2011 subject to a minimum sample allocation to each State/ UT.
Allocation to strata: Within each sector of a State/ UT, the respective sample size has been allocated to the different strata in proportion to the population as per census 2011. Allocations at stratum level are adjusted to multiples of 2 with a minimum sample size of 2. Allocation to sub-strata:

1 Rural: Allocation is 2 for each sub-stratum in rural.

2 Urban: Stratum allocations have been distributed among the two sub-strata in proportion to the number of FSUs in the sub-strata. Minimum allocation for each sub-stratum is 2. Selection of FSUs: For the rural sector, from each stratum x sub-stratum, required number of sample villages has been selected by Simple Random Sampling Without Replacement (SRSWOR). For the urban sector, FSUs have been selected by using Simple Random Sampling Without Replacement (SRSWOR) from each stratum x sub-stratum. Both rural and urban samples were drawn in the form of two independent sub-samples and equal number of samples has been allocated among the two sub rounds.

For details reexternal refer to external resouce "Note on Sample Design and Estimation Procedure of NSS 70th Round" Page no.2

Sampling deviation

There was no deviation from the original sampling design.

Mode of data collection

Face-to-face [f2f]

Research instrument

There are 17 blocks in visit 1. In Visits 1 & 2, Each sample FSU will be visited twice during this round. Since the workload of the first visit (i.e. visit 1) will be more, the first visit will continue till the end of July 2013. Thus, period of the first visit will be January - July 2013 and that of the second visit (i.e. visit 2) will be August - December 2013.
f
Geometric and hydrodynamic parameters for each cervical SAS model.
datasetcatalog.nlm.nih.gov
plos.figshare.com
Updated Oct 10, 2013
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Fischer, Paul; Martin, Bryn A.; Luciano, Mark; Kalata, Wojciech; Shaffer, Nicholas; Loth, Francis (2013). Geometric and hydrodynamic parameters for each cervical SAS model. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001687276
Explore at:
Dataset updated
Oct 10, 2013
Authors
Fischer, Paul; Martin, Bryn A.; Luciano, Mark; Kalata, Wojciech; Shaffer, Nicholas; Loth, Francis
Description
Note: , , and are mean values (Mean ± SD) calculated for the first 2.5 cm of the model length. Max was also calculated for the first 2.5 cm of the model length.
m
Data and Materials for: Preparedness Increases Confidence in Any Accessible...
data.mendeley.com
Updated Feb 25, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patrick J. Carroll (2020). Data and Materials for: Preparedness Increases Confidence in Any Accessible Thoughts Affecting Evaluation Unrelated to the Original Domain of Preparation [Dataset]. http://doi.org/10.17632/sxxmzcp66h.1
Explore at:
Unique identifier
https://doi.org/10.17632/sxxmzcp66h.1
Dataset updated
Feb 25, 2020
Authors
Patrick J. Carroll
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Supplementary Materials and SAS data files for Studies 1-3. The first half of the SAS files include the Hayes Process Macro. The syntax and program commands for the specific data set can be found at the end of the file.
H
Replication Data for: WHICH PANEL DATA ESTIMATOR SHOULD I USE?: A...
dataverse.harvard.edu
search.dataone.org
Updated Nov 16, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mantobaye Moundigbaye; William S. Rea; Robert rt Reed (2017). Replication Data for: WHICH PANEL DATA ESTIMATOR SHOULD I USE?: A CORRIGENDUM AND EXTENSION [Dataset]. http://doi.org/10.7910/DVN/YKSATT
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/YKSATT
Dataset updated
Nov 16, 2017
Dataset provided by
Harvard Dataverse
Authors
Mantobaye Moundigbaye; William S. Rea; Robert rt Reed
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataset contains all the materials needed to reproduce the results in "Which Panel Data Estimator Should I Use?: A Corrigendum and Extension". Please read the README document first. The results were obtained using SAS/IML software, and the files consist of SAS data sets and SAS programs.
i
Season Agriculture Survey 2019 - Rwanda
datacatalog.ihsn.org
catalog.ihsn.org
Updated Aug 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
National Institute of Statistics of Rwanda (2023). Season Agriculture Survey 2019 - Rwanda [Dataset]. https://datacatalog.ihsn.org/catalog/11419
Explore at:
Dataset updated
Aug 2, 2023
Dataset authored and provided by
National Institute of Statistics of Rwanda
Time period covered
2018 - 2019
Area covered
Rwanda
Description
Abstract

The main objective of the Seasonal Agricultural Survey is to provide timely, accurate, reliable and comprehensive agricultural statistics that describe the structure of agriculture in Rwanda mainly in terms of land use, crop area, yield and crop production to monitor current agricultural and food supply conditions and to facilitate evidence-based decision making for the development of the agricultural sector.

In this regard, the National Institute of Statistics of Rwanda conducted the Seasonal Agriculture Survey (SAS) from September 2018 to august 2019 to gather up-to-date information for monitoring progress on agriculture programs and policies. This 2019 SAS covered Main agricultural seasons are Season A (which starts from September to February of the following year) and Season B (which starts from March to June). Season C is the small agricultural season mainly for vegetables and sweet potato grown in swamps and Irish potato grown in volcanic agro-ecological zone and provides data on farm characteristics (area, yield and production), agricultural practices, agricultural inputs and use of crop production

Geographic coverage

National coverage allowing district-level estimation of key indicators

Analysis unit

This seasonal agriculture survey focused on the following units of analysis: Small scale agricultural farms and large scale farms

Universe

The SAS 2019 targeted potential agricultural land and large scale farmers

Kind of data

Sample survey data [ssd]

Sampling procedure

Out of 10 strata, only 4 are considered to represent the country land potential for agriculture, and they cover the total area of 1,787,571.2 hectares (ha). Those strata are: 1.0 (tea plantations), 1.1 (intensive agriculture land on hillsides), 2.0 (intensive agriculture land in marshlands) and 3.0 (rangelands). The remainder of land use strata represents all the non-agricultural land in Rwanda. Stratum 1.0, which represents tea plantations, is assumed to be well monitored through administrative records by the National Agriculture Export Board (NAEB), an institution whose main mission is to promote the agriculture export commodities. Thus, SAS is conducted on 3 strata (1.1; 2.0 & 3.0) to cover other major crops. Within district, the agriculture strata (1.1, 2.0 & 3.0) were divided into larger sampling units called first-step or primary sampling units (PSUs) (as shown in Figure 2). Strata 1.1 and 2.0 were divided into PSUs of around 100 ha while stratum 3.0 was divided into PSUs of around 500 ha. After sample size determination, a sample of PSUs was done by systematic sampling method with probability proportional to size, then a given number of PSUs to be selected for each stratum, was assigned in every district. In 2019, the 2018 SAS sample of 780 segments has been kept the same for SAS 2019 in Season A and B.

At first stage, 780 PSUs sampled countrywide were proportionally allocated in different levels of stratification (Hill side, marshland and rangeland strata) for 30 districts of Rwanda, to allow publication of results at district level. Sampled PSUs in each stratum were systematically selected from the frame with probability of selection proportional to the size of the PSU.

At the second stage 780 sampled PSUs were divided into secondary sampling units (SSUs) also called segments. Each segment is estimated to be around 10 ha for strata 1.1 and 2.0 and 50 ha for stratum 3.0 (as shown in Figure 3). For each PSU, only one SSU is selected by random sampling method without replacement. This is why for 2019 5 SAS season A and B, the same number of 780 SSUs was selected. In addition to this, a list frame of large-scale farmers (LSF), with at least 10 hectares of agricultural holdings, was done to complement the area frame just to cover crops mostly grown by large scale farmers and that cannot be easily covered in area frame

At the last sampling stage, in strata 1.1 and 2.0 each segment of an average size of 10 ha (100,000 Square meters) has been divided into around 1,000 grids squares of 100 Sq. meters each, while for stratum 3.0 around 5,000 grids squares of 100 Sq. meters each have been divided. A point was placed at the center of every grid square and named a grid point (A grid point is a geographical location at the center of every grid square). A random sample of 5% of the total grid points were selected in each segment of strata 1.1 and 2.0 whereas a random sample of 2% of total grid points was selected in each segment of stratum 3.0. Grids points are reporting units within a segment, where enumerators go to every grid point, locate and delineate the plots in which the grid falls, and collect records of land use and related information. The recorded information represents the characteristics of the whole segment which are extrapolated to the stratum level and hence the combination of strata within each district provides district area related statistics.

Mode of data collection

Face-to-face [f2f]

Research instrument

There were two types of questionnaires used for this survey namely screening questionnaire and plot questionnaire. A Screening questionnaire was used to collect information that enabled identification of a plot and its land use using the plot questionnaire. For point-sampling, the plot questionnaire is concerned with the collection of data on characteristics of crop identification, crop production and use of production, inputs (seeds, fertilizers and pesticides), agricultural practices and land tenure. All the surveys questionnaires used were published in English

Cleaning operations

The CAPI method of data collection allows the enumerators in the field to collect and enter data with their tablets and then synchronize to the server at headquarters where data are received by NISR staff, checked for consistency at NISR and thereafter transmitted to analysts for tabulation using STATA software, and reporting using office Excel and word as well.

Response rate

Data collection was done in 780 segments and 222 large scale farmers holdings for Season A, whereas in Season C data was collected in 232 segments, response rate was 100% of the sample
Annexes to the scientific report on the cumulative dietary exposure...
zenodo.org
bin
Updated Jan 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
European Food Safety Authority; Bruno Dujardin; Valentina Bocca; European Food Safety Authority; Bruno Dujardin; Valentina Bocca (2020). Annexes to the scientific report on the cumulative dietary exposure assessment of pesticides that have chronic effects on the thyroid using SAS® software - Input and output data sets [Dataset]. http://doi.org/10.5281/zenodo.3338152
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3338152
Dataset updated
Jan 24, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
European Food Safety Authority; Bruno Dujardin; Valentina Bocca; European Food Safety Authority; Bruno Dujardin; Valentina Bocca
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Retrospective dietary exposure assessments were conducted for two groups of pesticides that have chronic effects on the thyroid:

hypertrophy, hyperplasia and neoplasia of C-cells, i.e. affecting the parafollicular cells or the calcitonin system of the thyroid (CAG-TCP);

hypothyroidism, i.e. affecting the follicular cells and/or the hormone system of the thyroid (CAG-TCF).

The pesticides considered in this assessment were identified and characterised in the scientific report on the establishment of cumulative assessment groups of pesticides for their effects on the thyroid (here).

The exposure calculations used monitoring data collected by Member States under their official pesticide monitoring programmes in 2014, 2015 and 2016 and individual food consumption data from ten populations of consumers from different countries and from different age groups. Regarding the selection of relevant food commodities, the assessment included water, foods for infants and young children and 30 raw primary commodities of plant origin that are widely consumed within Europe.

Exposure estimates were obtained with SAS^® software using a 2-dimensional probabilistic method, which is composed of an inner-loop execution and an outer-loop execution. Variability within the population is modelled through the inner-loop execution and is expressed as a percentile of the exposure distribution. The outer-loop execution is used to derive 95% confidence intervals around those percentiles (reflecting the sampling uncertainty of the input data).

Furthermore, calculations were carried out according to a tiered approach. While the first-tier calculations (Tier I) use very conservative assumptions for an efficient screening of the exposure with low risk for underestimation, the second-tier assessment (Tier II) includes assumptions that are more refined but still conservative. For each scenario, exposure estimates were obtained for different percentiles of the exposure distribution and the total margin of exposure (MOET, i.e. the ratio of the toxicological reference dose to the estimated exposure) was calculated at each percentile.

The input and output data for the exposure assessment are reported in the following annexes:

Annex A.1 – Input data for the exposure assessment of CAG-TCP

Annex A.2 – Input data for the exposure assessment of CAG-TCF

Annex B.1 – Output data from the Tier I exposure assessment of CAG-TCP

Annex B.2 – Output data from the Tier I exposure assessment of CAG-TCF

Annex C.1 – Output data from the Tier II exposure assessment of CAG-TCP

Annex C.2 – Output data from the Tier II exposure assessment of CAG-TCF

Further information on the data, methodologies and interpretation of the results are provided in the scientific report on the cumulative dietary exposure assessment of pesticides that have chronic effects on the thyroid using SAS^® software (here).

The results reported in this assessment only refer to the exposure and are not an estimation of the actual risks. These exposure estimates should therefore be considered as documentation for the final scientific report on the cumulative risk assessment of dietary exposure to pesticides for their effects on the thyroid (here). The latter combines the hazard assessment and exposure assessment into a consolidated risk characterisation, including all related uncertainties.
f
Bayesian inference of protein conformational ensembles from limited...
plos.figshare.com
tiff
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Wojciech Potrzebowski; Jill Trewhella; Ingemar Andre (2023). Bayesian inference of protein conformational ensembles from limited structural data [Dataset]. http://doi.org/10.1371/journal.pcbi.1006641
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pcbi.1006641
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS Computational Biology
Authors
Wojciech Potrzebowski; Jill Trewhella; Ingemar Andre
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Many proteins consist of folded domains connected by regions with higher flexibility. The details of the resulting conformational ensemble play a central role in controlling interactions between domains and with binding partners. Small-Angle Scattering (SAS) is well-suited to study the conformational states adopted by proteins in solution. However, analysis is complicated by the limited information content in SAS data and care must be taken to avoid constructing overly complex ensemble models and fitting to noise in the experimental data. To address these challenges, we developed a method based on Bayesian statistics that infers conformational ensembles from a structural library generated by all-atom Monte Carlo simulations. The first stage of the method involves a fast model selection based on variational Bayesian inference that maximizes the model evidence of the selected ensemble. This is followed by a complete Bayesian inference of population weights in the selected ensemble. Experiments with simulated ensembles demonstrate that model evidence is capable of identifying the correct ensemble and that correct number of ensemble members can be recovered up to high level of noise. Using experimental data, we demonstrate how the method can be extended to include data from Nuclear Magnetic Resonance (NMR) and structural energies of conformers extracted from the all-atom energy functions. We show that the data from SAXS, NMR chemical shifts and energies calculated from conformers can work synergistically to improve the definition of the conformational ensemble.
Summary statistics for participants in interviews at month-one and...
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vennie Mbaziira Nabitaka; Pamela Nawaggi; Jennifer Campbell; James Conroy; Joseph Harwell; Kinanga Magambo; Caroline Middlecote; Benvy Caldwell; Cordelia Katureebe; Norah Namuwenge; Rita Atugonza; Andrew Musoke; Joshua Musinguzi (2023). Summary statistics for participants in interviews at month-one and month-six. [Dataset]. http://doi.org/10.1371/journal.pone.0232419.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0232419.t003
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Vennie Mbaziira Nabitaka; Pamela Nawaggi; Jennifer Campbell; James Conroy; Joseph Harwell; Kinanga Magambo; Caroline Middlecote; Benvy Caldwell; Cordelia Katureebe; Norah Namuwenge; Rita Atugonza; Andrew Musoke; Joshua Musinguzi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Summary statistics for participants in interviews at month-one and month-six.
Baseline summary statistics of all patients that were enrolled in the study....
plos.figshare.com
datasetcatalog.nlm.nih.gov
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vennie Mbaziira Nabitaka; Pamela Nawaggi; Jennifer Campbell; James Conroy; Joseph Harwell; Kinanga Magambo; Caroline Middlecote; Benvy Caldwell; Cordelia Katureebe; Norah Namuwenge; Rita Atugonza; Andrew Musoke; Joshua Musinguzi (2023). Baseline summary statistics of all patients that were enrolled in the study. [Dataset]. http://doi.org/10.1371/journal.pone.0232419.t001
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0232419.t001
Dataset updated
Jun 1, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Vennie Mbaziira Nabitaka; Pamela Nawaggi; Jennifer Campbell; James Conroy; Joseph Harwell; Kinanga Magambo; Caroline Middlecote; Benvy Caldwell; Cordelia Katureebe; Norah Namuwenge; Rita Atugonza; Andrew Musoke; Joshua Musinguzi
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Baseline summary statistics of all patients that were enrolled in the study.
f
Supplement 1. Detailed description of how the methods are applied to data,...
wiley.figshare.com
html
Updated Jun 5, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
John Connolly; Marc W. Cadotte; Caroline Brophy; Áine Dooley; John Finn; Laura Kirwan; Christiane Roscher; Alexandra Weigelt (2023). Supplement 1. Detailed description of how the methods are applied to data, including SAS and R code and data from two experiments. [Dataset]. http://doi.org/10.6084/m9.figshare.3551508.v1
Explore at:
htmlAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.3551508.v1
Dataset updated
Jun 5, 2023
Dataset provided by
Wiley
Authors
John Connolly; Marc W. Cadotte; Caroline Brophy; Áine Dooley; John Finn; Laura Kirwan; Christiane Roscher; Alexandra Weigelt
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
File List

Jena_dataset.pdf Worked example of model fitting for the Jena_dataset.pdf Jena_dataset.sas SAS code for analysis of Jena_dataset.sas Jena_dataset.r R code for analysis of Jena_dataset.r Jena_data.csv Jena data Ireland_site_biodepth.csv Data for Ireland_site_Biodepth.csv

Description The supplements are designed to assist the reader to implement the methods using the statistical packages SAS and R. The first supplement (Worked example of model fitting for theJena_dataset.pdf) provides a detailed description of the application and interpretation of a range of models using the Jena dataset. The second and third supplements (SAS code for analysis of Jena_dataset.sas) and (R code for analysis of Jena_dataset.r) provide SAS and R code to implement the method using the Jena dataset. The data for the two sites is provided in Jena_data.csv and Ireland_site_biodepth.csv. Hash values for supplements Jena_data.csv and Ireland_site_biodepth.csv calculated by HASHCALC: MD5 hash value for Jena_data.csv 6b86c280a15bbd4aae08b5b4c91363ee MD5 hash value for Ireland_site_biodepth.csv 9b60c32ceca9259e47d7ee42b9ae5f16

Facebook

Twitter

Click to copy link

Link copied

Cite

U.S. EPA Office of Research and Development (ORD) (2020). SAS code used to analyze data and a datafile with metadata glossary [Dataset]. https://catalog.data.gov/dataset/sas-code-used-to-analyze-data-and-a-datafile-with-metadata-glossary

SAS code used to analyze data and a datafile with metadata glossary

Explore at:

Dataset updated

Nov 12, 2020

Dataset provided by

United States Environmental Protection Agencyhttp://www.epa.gov/

Description

We compiled macroinvertebrate assemblage data collected from 1995 to 2014 from the St. Louis River Area of Concern (AOC) of western Lake Superior. Our objective was to define depth-adjusted cutoff values for benthos condition classes (poor, fair, reference) to provide tool useful for assessing progress toward achieving removal targets for the degraded benthos beneficial use impairment in the AOC. The relationship between depth and benthos metrics was wedge-shaped. We therefore used quantile regression to model the limiting effect of depth on selected benthos metrics, including taxa richness, percent non-oligochaete individuals, combined percent Ephemeroptera, Trichoptera, and Odonata individuals, and density of ephemerid mayfly nymphs (Hexagenia). We created a scaled trimetric index from the first three metrics. Metric values at or above the 90th percentile quantile regression model prediction were defined as reference condition for that depth. We set the cutoff between poor and fair condition as the 50th percentile model prediction. We examined sampler type, exposure, geographic zone of the AOC, and substrate type for confounding effects. Based on these analyses we combined data across sampler type and exposure classes and created separate models for each geographic zone. We used the resulting condition class cutoff values to assess the relative benthic condition for three habitat restoration project areas. The depth-limited pattern of ephemerid abundance we observed in the St. Louis River AOC also occurred elsewhere in the Great Lakes. We provide tabulated model predictions for application of our depth-adjusted condition class cutoff values to new sample data. This dataset is associated with the following publication: Angradi, T., W. Bartsch, A. Trebitz, V. Brady, and J. Launspach. A depth-adjusted ambient distribution approach for setting numeric removal targets for a Great Lakes Area of Concern beneficial use impairment: Degraded benthos. JOURNAL OF GREAT LAKES RESEARCH. International Association for Great Lakes Research, Ann Arbor, MI, USA, 43(1): 108-120, (2017).

Clear search

Close search

Google apps

Main menu

SAS code used to analyze data and a datafile with metadata glossary

Editing EU-SILC UDB Longitudinal Data for Differential Mortality Analyses....

SAS: Semantic Artist Similarity Dataset

DHS data extractors for Stata

The coefficients ai, bi, ci, di, ei, fi of the affine transforms wn (first...

Situation Assessment Survey, 2003 - India

Abstract

Geographic coverage

Analysis unit

Kind of data

Sampling procedure

Mode of data collection

Spoofing and Anti-Spoofing (SAS) corpus v1.0

Synthetic Aperture Sonar Survey to Locate Archaeological Resources in the...

SAS-Bench

Data from: Dual kinetic and structural role for the surface in guiding SAS-6...

Situation Assessment Survey of Agricultural Households, January - December...

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Sampling deviation

Mode of data collection

Research instrument

Geometric and hydrodynamic parameters for each cervical SAS model.

Data and Materials for: Preparedness Increases Confidence in Any Accessible...

Replication Data for: WHICH PANEL DATA ESTIMATOR SHOULD I USE?: A...

Season Agriculture Survey 2019 - Rwanda

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

Annexes to the scientific report on the cumulative dietary exposure...

Bayesian inference of protein conformational ensembles from limited...

Summary statistics for participants in interviews at month-one and...

Baseline summary statistics of all patients that were enrolled in the study....

Supplement 1. Detailed description of how the methods are applied to data,...

SAS code used to analyze data and a datafile with metadata glossarySee More Versions

SAS code used to analyze data and a datafile with metadata glossary