Note: This filtered view shows only those service requests from the underlying dataset that are not marked as duplicates. -- All open graffiti removal requests made to 311 and all requests completed since January 1, 2011. The Department of Streets & Sanitation's Graffiti Blasters crews offer a vandalism removal service to private property owners. Graffiti Blasters employ "blast" trucks that use baking soda under high water pressure to erase painted graffiti from brick, stone and other mineral surfaces. They also use paint trucks to cover graffiti on the remaining surfaces. Organizations and residents may report graffiti and request its removal. 311 sometimes receives duplicate requests for graffiti removal. Requests that have been labeled as Duplicates are in the same geographic area and have been entered into 311’s Customer Service Requests (CSR) system at around the same time as a previous request. Duplicate reports/requests are labeled as such in the Status field, as either "Open - Dup" or "Completed - Dup." Data is updated daily.
The U. S. Geological Survey (USGS) makes long-term seismic hazard forecasts that are used in building codes. The hazard models usually consider only natural seismicity; non-tectonic (man-made) earthquakes are excluded because they are transitory or too small. In the past decade, however, thousands of earthquakes related to underground fluid injection have occurred in the central and eastern U.S. (CEUS), and some have caused damage. In response, the USGS is now also making short-term forecasts that account for the hazard from these induced earthquakes. A uniform earthquake catalog is assembled by combining and winnowing pre-existing source catalogs. Seismicity statistics are analyzed to develop recurrence models, accounting for catalog completeness. In the USGS hazard modeling methodology, earthquakes are counted on a map grid, recurrence models are applied to estimate the rates of future earthquakes in each grid cell, and these rates are combined with maximum-magnitude models and ground-motion models to compute the hazard. The USGS published a forecast for the years 2016 and 2017. This data set is the catalog of natural and induced earthquakes without duplicates. Duplicate events have been removed based on a hierarchy of the source catalogs. Explosions and mining related events have been deleted.
National Center for Earth-surface Dynamics Data Repository
This dataset represents completed chemical land treatments on BLM managed lands in the states of Oregon and Washington. Chemical treatments are applications of herbicide or pesticide, to control or kill pests and invasive plants, or fertilizer to enhance plant growth sourced from the BLM HUB..EDITS: The following edits were applied to the dataset in order to reduce polygon clutter, remove duplicate records, and generally make the dataset more useful for its intended purpose of overlaying treatment perimeters with other layers for landscape scale, cross-jurisdictional planning:Delete Identical tool using shape as the input parameter Giant polygons incommensurate with treated acres were manually removedEntries with treated acres <1 acre were removedEntries with GIS acres <10 acres were removedA Polsby-Popper test was used to remove polygons that appeared as perfect circles - the multitude of circle polygons representing vague treatment locations often cluttered the map and could be used for meaningful analysisA ratio of the area of polygons resulting from the Minimum Bounding Geometry tool compared to actual GIS acres of each polygon was used to remove perfect squares (values closer to 1 were perfect squares or rectangles) for a similar reason as the perfect circles
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/.
https://opensource.org/licenses/BSD-3-Clausehttps://opensource.org/licenses/BSD-3-Clause
onion (ONe Instance ONly) is a tool for removing duplicate parts from large collections of texts. The tool has been implemented in Python, licensed under New BSD License and made an open source software (available for download including the source code at http://code.google.com/p/onion/). It is being successfuly used for cleaning large textual corpora at Natural language processing centre at Faculty of informatics, Masaryk university Brno and it's industry partners. The research leading to this piece of software was published in author's Ph.D. thesis "Removing Boilerplate and Duplicate Content from Web Corpora". The deduplication algorithm is based on comparing n-grams of words of text. The author's algorithm has been shown to be more suitable for textual corpora deduplication than competing algorithms (Broder, Charikar): in addition to detection of identical or very similar (95 %) duplicates, it is able to detect even partially similar duplicates (50 %) still achieving great performace (further described in author's Ph.D. thesis). The unique deduplication capabilities and scalability of the algorithm were been demonstrated while building corpora of American Spanish, Arabic, Czech, French, Japanese, Russian, Tajik, and six Turkic languages consisting --- several TB of text documents were deduplicated resulting in corpora of 70 billions tokens altogether.
Chromatin immunoprecipitation and sequencing (ChIP-seq) has been widely used to map DNA-binding proteins, histone proteins and their modifications. ChIP-seq data contains redundant reads termed duplicates, referring to those mapping to the same genomic location and strand. There are two main sources of duplicates: polymerase chain reaction (PCR) duplicates and natural duplicates. Unlike natural duplicates that represent true signals from sequencing of independent DNA templates, PCR duplicates are artifacts originating from sequencing of identical copies amplified from the same DNA template. In analysis, duplicates are removed from peak calling and signal quantification. Nevertheless, a significant portion of the duplicates is believed to represent true signals. Obviously, removing all duplicates will underestimate the signal level in peaks and impact the identification of signal changes across samples. Therefore, an in-depth evaluation of the impact from duplicate removal is needed. Using eight public ChIP-seq datasets from three narrow-peak and two broad-peak marks, we tried to understand the distribution of duplicates in the genome, the extent by which duplicate removal impacts peak calling and signal estimation, and the factors associated with duplicate level in peaks. The three PCR-free histone H3 lysine 4 trimethylation (H3K4me3) ChIP-seq data had about 40% duplicates and 97% of them were within peaks. For the other datasets generated with PCR amplification of ChIP DNA, as expected, the narrow-peak marks have a much higher proportion of duplicates than the broad-peak marks. We found that duplicates are enriched in peaks and largely represent true signals, more conspicuous in those with high confidence. Furthermore, duplicate level in peaks is strongly correlated with the target enrichment level estimated using nonredundant reads, which provides the basis to properly allocate duplicates between noise and signal. Our analysis supports the feasibility of retaining the portion of signal duplicates into downstream analysis, thus alleviating the limitation of complete deduplication.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
🇮🇪 아일랜드
Note: This filtered view shows only those service requests from the underlying dataset that are not marked as duplicates. -- All open tree debris removal requests made to 311 and all requests completed since January 1, 2011. Large piles of branches or bushes may be picked up by the Department of Streets and Sanitation. 311 sometimes creates duplicate requests for tree debris removal. When there is an open tree debris request, a duplicate request is created when the exact same address and the exact same service request type are used. Streets and Sanitation responds to the initial request opened and closes the duplicates. A forestry "Clam" is the name of the vehicle the Forestry Bureau deploys to collect tree debris. Data Owner: Streets and Sanitation (http://www.cityofchicago.org/content/city/en/depts/streets.html). Time Period: January 1, 2011 to present. Frequency: Data is updated daily. Related Applications: 311 Service Request Status Inquiry (https://servicerequest.cityofchicago.org/web_intake_chic/Controller?op=createsrquery2) and Request Tree Debris Removal (https://servicerequest.cityofchicago.org/web_intake_chic/Controller?op=locform&invSRType=SEL&invSRDesc=Tree%20Debris&locreq=Y).
pvduy/argilla-dpo-mix-7k-gpt4o-refined-remove-same dataset hosted on Hugging Face and contributed by the HF Datasets community
https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy
The market for duplicate contact remover apps is experiencing robust growth, driven by the increasing use of smartphones and multiple social media accounts, leading to a proliferation of duplicate contacts across various devices. The market's expansion is fueled by the rising need for efficient contact management, particularly among professionals and individuals managing large contact lists. Businesses are increasingly adopting these apps to streamline their operations and improve data quality, leading to higher productivity and reduced administrative burdens. User demand for seamless data synchronization across platforms and enhanced privacy features further contributes to market expansion. While the exact market size for 2025 is unavailable, a reasonable estimation based on typical growth rates in similar software markets would place it within the range of $150-$200 million. Considering a conservative Compound Annual Growth Rate (CAGR) of 15% for the forecast period (2025-2033), we project substantial growth, reaching a potential market value of $600-$800 million by 2033. This growth trajectory is expected despite potential restraints like the availability of built-in contact management features in operating systems and the apprehension of users regarding data privacy and security related to third-party apps. The competitive landscape is relatively fragmented, with several key players vying for market share. Companies like ActivePrime, Compelson Labs, Systweak Software, and others offer a range of features, from basic duplicate detection to advanced functionalities like merging and deduplication across multiple accounts. Future growth will depend on the ability of these companies to innovate and offer unique value propositions, focusing on features like AI-powered contact organization, improved user interfaces, and enhanced integration with other productivity apps. Geographical expansion, particularly into emerging markets with a growing smartphone user base, will be a crucial factor in driving future revenue. The segment most likely to experience the strongest growth will be the enterprise segment, given the need for improved data management in large organizations. Marketing efforts focusing on the benefits of improved contact management, data accuracy, and time savings are key for success in this market.
During the fourth quarter of 2024, approximately 82 percent of removed video comments on the Google platform YouTube were deleted due to being spam, misleading, or scam content. Additionally, in the same quarter, around six percent of comments on videos were removed due to child safety reasons. Over 842.8 million comments were deleted from YouTube during the examined period.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Frequency of deleted and duplicated alleles in sperm from three control donors.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
BackgroundSurvivors of intimate partner violence (IPV) often face increased incidents of violence during stressful life events (SLEs) such as economic recessions, environmental disasters, and pandemics. These events can diminish the effectiveness of both formal (e.g., health, social, justice, labor, community) and informal (e.g., friends, family, neighbors) support systems. Additionally, SLEs exacerbate existing health and social inequities, making it necessary to understand the accessibility of support services during these times. This scoping review investigates access to services by individuals experiencing IPV during SLEs in high-income countries.ApproachA comprehensive search was conducted across several electronic databases including MEDLINE (OVID), Embase (OVID), PsychInfo (OVID), CINAHL (EBSCO), Global Health (EBSCO), Gender Watch (ProQuest), Web of Science, and Applied Social Sciences Index & Abstracts (ProQuest), along with the search engine Google Scholar. This search, which imposed no date restrictions, was extended through May 22nd, 2024. Key search terms were developed from prior literature and in consultation with an expert librarian, focusing on ‘stressful life events,’ ‘intimate partner violence,’ and ‘access to services.’. Each study was screened and extracted by two reviewers and conflicts were resolved through discussion or a third reviewer.ResultsThe search across eight databases and citation searching resulted in a total of 7396 potentially relevant articles. After removing 1968 duplicates and screening 5428 based on titles and abstracts, 200 articles underwent full abstract review. Ultimately, 74 articles satisfied the inclusion criteria and were selected for further analysis. The analysis focused on barriers and facilitators to access, identifying challenges within Survivors’ support systems, redirected resources during crises, and complex control dynamics and marginalization. Over 90% of the literature included covered the recent COVID-19 pandemic. Addressing these challenges requires innovative strategies, sustained funding, and targeted interventions for high-risk subgroups.ConclusionThis scoping review systematically outlined the challenges and enabling factors influencing the availability of support services for Survivors of IPV during SLEs. It underscores the need for robust, culturally sensitive health and social support mechanisms, and policies. Such measures are essential to better protect and assist IPV Survivors and their service providers during these critical times. Furthermore, it is imperative to integrate the insights and expertise of the violence against women (VAW) sector into emergency planning and policy-making to ensure comprehensive and effective responses that address the unique needs of Survivors in crises.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Supplement 3: Screening outcomes and reasons for inclusion/exclusion of articles in the scoping review (n = 5428 articles).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Barriers and Facilitators to accessing IPV supports during SLEs.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Note: This filtered view shows only those service requests from the underlying dataset that are not marked as duplicates. -- All open graffiti removal requests made to 311 and all requests completed since January 1, 2011. The Department of Streets & Sanitation's Graffiti Blasters crews offer a vandalism removal service to private property owners. Graffiti Blasters employ "blast" trucks that use baking soda under high water pressure to erase painted graffiti from brick, stone and other mineral surfaces. They also use paint trucks to cover graffiti on the remaining surfaces. Organizations and residents may report graffiti and request its removal. 311 sometimes receives duplicate requests for graffiti removal. Requests that have been labeled as Duplicates are in the same geographic area and have been entered into 311’s Customer Service Requests (CSR) system at around the same time as a previous request. Duplicate reports/requests are labeled as such in the Status field, as either "Open - Dup" or "Completed - Dup." Data is updated daily.