Facebook
TwitterWe include a description of the data sets in the meta-data as well as sample code and results from a simulated data set. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: The R code is available on line here: https://github.com/warrenjl/SpGPCW. Format: Abstract The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publicly available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. File format: R workspace file. Metadata (including data dictionary) • y: Vector of binary responses (1: preterm birth, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate). This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
Facebook
TwitterA minor naming issue, the dataset was originally just income data. While the dataset has been renamed, the link has not. This data is not just income data.
This dataset was uploaded to determine a possible relation between economic status of counties, and COVID-19 cases in Maryland by-county. While the data is from only 2018, 2020 census data is not available until March, 2021. This data is built to analyze features in Maryland that concern quality of living.
These datasets come from https://commerce.maryland.gov/about/rankings-and-statistics/data-explorer. They were generated by the US Census Bureau in 2018.
Disclaimer: No infringement is intended in any way, sources are US Census Bureau, Maryland Department of Labor Licensing and Regulation, Comptroller of Maryland, and Maryland Association of Realtors. Licensing statement: (https://commerce.maryland.gov/commerce/privacy)
DISCLAIMER OF LIABILITY & ENDORSEMENT
The content of the this site or any other site, owned, operated, licensed, sponsored or controlled by the Maryland Department of Commerce comes from a variety of public and private information sources. Although effort has been made to provide complete and accurate information, the information should always be verified before it is used in any way. Maryland Commerce makes no warranty, express or implied, nor assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information from the service or links to other services. Any reference obtained from this site to a specific company, commercial product, process, or service does not constitute or imply an endorsement, recommendation, or favoring by Maryland Commerce of the company, commercial product, process, or service.
NOTICE SPECIFIC TO DOCUMENTS AVAILABLE ON THIS WEBSITE
Permission to use documents (such as reports, publications, press releases, datasheets and FAQs) from this site is granted, provided that (1) the copyright notice appears in all copies and that both the copyright notice and this permission notice appear, (2) use of such documents from this site is for informational and noncommercial or personal use only and will not be copied or posted on any network computer or broadcast in any media, and (3) no modifications of any documents are made. Documents specified above do not include the design or layout of this website or any other site owned, operated, licensed, sponsored or controlled by Maryland Commerce. Elements of Maryland Commerce Web sites are protected by trade dress, trademark, unfair competition, and other laws and may not be copied or imitated in whole or in part. No logo, graphic, sound or image from any Maryland Commerce website may be copied, retransmitted, or used in a manner which suggests Maryland Commerce's association with or endorsement of any product, service, opinion or cause unless expressly permitted by Maryland Commerce. The documents and related graphics published on this site could include technical inaccuracies or typographical errors. Maryland Commerce may make changes in the site described herein at any time.
NOTICE REGARDING DOCUMENTS AND SERVICES AVAILABLE ON THIS WEBSITE
Users entering materials on this website or any other site owned, operated, licensed, sponsored or controlled by Maryland Commerce are responsible for the content of that material. Maryland Commerce has no responsibility for the content of any messages or information posted by users, or for the content of information of third parties on the Internet, even if accessed through Maryland Commerce pages. Maryland Commerce does retain the right, which it may or may not exercise, in its sole discretion, to review, edit or delete from the service any third-party material which Maryland Commerce deems to be inappropriate in any way. In no event shall Maryland Commerce be liable for any damages whatsoever resulting from loss of use, data or profits, whether in an action of contract, negligence or other tort action, arising out of or in connection with the use or performance of documents, provision or failure to provide services, or information available from this service.
LINKS TO THIRD PARTY SITES
Some links in this Web site or any other site owned, operated, licensed, sponsored or controlled by Maryland Commerce are not under the control of Maryland Commerce and Maryland Commerce is not responsible for the contents of any linked site or any link contained in a linked site, or any changes or updates to such sites. Maryland Commerce is not responsible for webcasting or any other form of transmission received from any linked site. Maryland Commerce is providing these links to you only as a convenience and the inclusion of any link does not imply endorsement by Maryland Commerce of the site.
Facebook
TwitterThese are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: File format: R workspace file; “Simulated_Dataset.RData”. Metadata (including data dictionary) • y: Vector of binary responses (1: adverse outcome, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate) Code Abstract We provide R statistical software code (“CWVS_LMC.txt”) to fit the linear model of coregionalization (LMC) version of the Critical Window Variable Selection (CWVS) method developed in the manuscript. We also provide R code (“Results_Summary.txt”) to summarize/plot the estimated critical windows and posterior marginal inclusion probabilities. Description “CWVS_LMC.txt”: This code is delivered to the user in the form of a .txt file that contains R statistical software code. Once the “Simulated_Dataset.RData” workspace has been loaded into R, the text in the file can be used to identify/estimate critical windows of susceptibility and posterior marginal inclusion probabilities. “Results_Summary.txt”: This code is also delivered to the user in the form of a .txt file that contains R statistical software code. Once the “CWVS_LMC.txt” code is applied to the simulated dataset and the program has completed, this code can be used to summarize and plot the identified/estimated critical windows and posterior marginal inclusion probabilities (similar to the plots shown in the manuscript). Optional Information (complete as necessary) Required R packages: • For running “CWVS_LMC.txt”: • msm: Sampling from the truncated normal distribution • mnormt: Sampling from the multivariate normal distribution • BayesLogit: Sampling from the Polya-Gamma distribution • For running “Results_Summary.txt”: • plotrix: Plotting the posterior means and credible intervals Instructions for Use Reproducibility (Mandatory) What can be reproduced: The data and code can be used to identify/estimate critical windows from one of the actual simulated datasets generated under setting E4 from the presented simulation study. How to use the information: • Load the “Simulated_Dataset.RData” workspace • Run the code contained in “CWVS_LMC.txt” • Once the “CWVS_LMC.txt” code is complete, run “Results_Summary.txt”. Format: Below is the replication procedure for the attached data set for the portion of the analyses using a simulated data set: Data The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publically available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics, and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
Facebook
TwitterFirst, a caveat: the NFIP data does NOT provide information specific to individual homes or parcels. This information is protected under federal law. All personal identifying information about policy holders has been redacted, and data has been anonymized to census tract, reported ZIP code, and one decimal point digit of latitute and longitude. If mapped, flood insurance policies and claims may appear to be clustered at a particular location due to this anonymization. What all that means: you cannot search for an address to see whether it has flooded. However, among many things, this data shows flooding trends in Norfolk over the last 40+ years. It shows the census tracts that flood most frequently. And it shows where the largest number and highest value of claims occur.
FEMA believes this historic release of NFIP data promotes transparency, reduces complexity related to public data requests, and improves how stakeholders interact with and understand the program. This is the largest, most comprehensive release of NFIP data coordinated by FEMA to date. This dataset allows for customizable searches to create reports, analyze and visualize present and historical NFIP data faster and easier than before. This data will help FEMA build a national culture of preparedness by providing claims and policy information people need to make better choices about their flood risk and the insurance they need to protect the life they've built. Norfolk's Open Data team extracted city-specific information from the FEMA dataset. The dataset included here represents almost 6,000 claims on record from 1977 through 2019, totaling 67 million dollars in damage in the City of Norfolk.
To view the most updated version of the dataset, please click here: https://data.norfolk.gov/Government/FEMA-NFIP-Claims/suf7-r643/about_data
Facebook
TwitterThe Quarterly Census of Employment and Wages (QCEW) Program is a Federal-State cooperative program between the U.S. Department of Labor’s Bureau of Labor Statistics (BLS) and the California EDD’s Labor Market Information Division (LMID). The QCEW program produces a comprehensive tabulation of employment and wage information for workers covered by California Unemployment Insurance (UI) laws and Federal workers covered by the Unemployment Compensation for Federal Employees (UCFE) program. The QCEW program serves as a near census of monthly employment and quarterly wage information by 6-digit industry codes from the North American Industry Classification System (NAICS) at the national, state, and county levels. At the national level, the QCEW program publishes employment and wage data for nearly every NAICS industry. At the state and local area level, the QCEW program publishes employment and wage data down to the 6-digit NAICS industry level, if disclosure restrictions are met. In accordance with the BLS policy, data provided to the Bureau in confidence are used only for specified statistical purposes and are not published. The BLS withholds publication of Unemployment Insurance law-covered employment and wage data for any industry level when necessary to protect the identity of cooperating employers. Data from the QCEW program serve as an important input to many BLS programs. The Current Employment Statistics and the Occupational Employment Statistics programs use the QCEW data as the benchmark source for employment. The UI administrative records collected under the QCEW program serve as a sampling frame for the BLS establishment surveys. In addition, the data serve as an input to other federal and state programs. The Bureau of Economic Analysis (BEA) of the Department of Commerce uses the QCEW data as the base for developing the wage and salary component of personal income. The U.S. Department of Labor’s Employment and Training Administration (ETA) and California's EDD use the QCEW data to administer the Unemployment Insurance program. The QCEW data accurately reflect the extent of coverage of California’s UI laws and are used to measure UI revenues; national, state and local area employment; and total and UI taxable wage trends. The U.S. Department of Labor’s Bureau of Labor Statistics publishes new QCEW data in its County Employment and Wages news release on a quarterly basis. The BLS also publishes a subset of its quarterly data through the Create Customized Tables system, and full quarterly industry detail data at all geographic levels. Disclaimer: For information regarding future updates or preliminary/final data releases, please refer to the Bureau of Labor Statistics Release Calendar: https://www.bls.gov/cew/release-calendar.htm
Facebook
TwitterFirst, a caveat: the NFIP data does NOT provide information specific to individual homes or parcels. This information is protected under federal law. All personal identifying information about policy holders has been redacted, and data has been anonymized to census tract, reported ZIP code, and one decimal point digit of latitute and longitude. If mapped, flood insurance policies and claims may appear to be clustered at a particular location due to this anonymization. What all that means: you cannot search for an address to see whether it has flooded. However, among many things, this data shows flooding trends in Norfolk over the last 40+ years. It shows the census tracts that flood most frequently. And it shows where the largest number and highest value of claims occur.
FEMA believes this historic release of NFIP data promotes transparency, reduces complexity related to public data requests, and improves how stakeholders interact with and understand the program. This is the largest, most comprehensive release of NFIP data coordinated by FEMA to date. This dataset allows for customizable searches to create reports, analyze and visualize present and historical NFIP data faster and easier than before. This data will help FEMA build a national culture of preparedness by providing claims and policy information people need to make better choices about their flood risk and the insurance they need to protect the life they've built. Norfolk's Open Data team extracted city-specific information from the FEMA dataset. The dataset included here represents almost 6,000 claims on record from 1977 through 2019, totaling 67 million dollars in damage in the City of Norfolk.
Facebook
TwitterThis data set includes all verified Intimate Partner Violence occurrences investigated by Toronto Police Service by reported date since 2014. The Intimate Partner Violence categories include Family, Intimate Partner, and Unclassified. Count is an aggregated count of Intimate Partner Violence by fields listed in the dataset.Intimate Partner Violence DashboardDownload DocumentationDefinitions:FamilyFamilial relationships such as parents, siblings, or any other family members.Intimate PartnerThe TPS IPV Procedure defines an Intimate Relationship as "marriage, domestic partnership, engagement, casual or serious romantic involvement, and dating, whether in a current or former relationship. Intimate Partner Violence can occur between persons of any sex, sexual orientation, gender, gender identity, or gender expression, and it can occur in any type of intimate relationship including monogamous, non- committed, and relationships involving more than two partners.Intimate Relationship – DatingIntimate Relationship - Dating means marriage, domestic partnership, engagement, casual or serious romantic involvement, and dating, whether in a current or former relationship. Intimate Partner Violence can occur between persons of any sex, sexual orientation, gender, gender identity, or gender expression, and it can occur in any type of intimate relationship including monogamous, non-committed, and relationships involving more than two partners.Intimate Partner ViolenceIntimate Partner Violence means any physical, sexual or psychological harm caused, or attempted, between persons involved in an intimate relationship including: • assault;• murder;• sexual assault;• threatening;• harassment;• intimidation;• unlawful interference with personal liberty;• any other criminal offence;• offences under other statutes, such as the Family Law Act, Children’s Law Reform Act, etc.;but does not include child abuse investigations.UnclassifiedThe relationship between the victim and the accused in the occurrence is not identified or indicated but the investigating officers have identified this as an intimate partner related incident. This data only includes confirmed Intimate Partner Violence crimes. This data does not include occurrences that have been deemed unfounded. The definition of unfounded according to Statistics Canada is: “It has been determined through police investigation that the offence reported did not occur, nor was it attempted” (Statistics Canada, 2020).**The dataset is intended to provide communities with information regarding public safety and awareness. The location of Intimate Partner Violence occurrences have been grouped to the fields in the dataset to protect the privacy of the parties involved in the occurrence. By accessing these datasets, the user agrees to full acknowledgement of the Open Government Licence - Ontario.In accordance with the Municipal Freedom of Information and Protection of Privacy Act, the Toronto Police Service has taken the necessary measures to protect the privacy of individuals involved in the reported occurrences. No personal information related to any of the parties involved in the occurrence will be released as open data.** Statistics Canada. 2020. Uniform Crime Reporting Manual. Surveys and Statistical Programs. Canadian Centre for Justice Statistics.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterWe include a description of the data sets in the meta-data as well as sample code and results from a simulated data set. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: The R code is available on line here: https://github.com/warrenjl/SpGPCW. Format: Abstract The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publicly available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. File format: R workspace file. Metadata (including data dictionary) • y: Vector of binary responses (1: preterm birth, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate). This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).