https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Russian language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Japanese language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
For closed-shell molecules, valence electron binding energies may be calculated accurately and efficiently with ab initio electron-propagator methods that have surpassed their predecessors. Advantageous combinations of accuracy and efficiency range from cubically scaling methods with mean errors of 0.2 eV to quintically scaling methods with mean errors of 0.1 eV or less. The diagonal self-energy approximation in the canonical Hartree–Fock basis is responsible for the enhanced efficiency of several methods. This work examines the predictive capabilities of diagonal self-energy approximations when they are generalized to the canonical spin–orbital basis of unrestricted Hartree–Fock (UHF) theory. Experimental data on atomic electron binding energies and high-level, correlated calculations in a fixed basis for a set of open-shell molecules constitute standards of comparison. A review of the underlying theory and analysis of numerical errors lead to several recommendations for the calculation of electron binding energies: (1) In calculations that employ the diagonal self-energy approximation, Koopmans’s identity for UHF must be qualitatively correct. (2) Closed-shell reference states are preferable to open-shell reference states in calculations of molecular ionization energies and electron affinities. (3) For molecular electron binding energies between doublets and triplets, calculations of electron detachment energies are more accurate and efficient than calculations of electron attachment energies. When these recommendations are followed, mean absolute errors increase by approximately 0.05 eV with respect to their counterparts obtained with closed-shell reference orbitals.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Tandem mass spectrometry is the prevailing approach for large-scale peptide sequencing in high-throughput proteomic profiling studies. Effective database search engines have been developed to identify peptide sequences from MS/MS fragmentation spectra. Since proteins are polymorphic and subject to post-translational modifications (PTM), however, computational methods for detecting unanticipated variants are also needed to achieve true proteome-wide coverage. Different from existing “unrestrictive” search tools, we present a novel algorithm, termed SIMS (for Sequential Motif Interval Search), that interprets pairs of product ion peaks, representing potential amino acid residues or “intervals”, as a means of mapping PTMs or substitutions in a blind database search mode. An effective heuristic software program was likewise developed to evaluate, rank, and filter optimal combinations of relevant intervals to identify candidate sequences, and any associated PTM or polymorphism, from large collections of MS/MS spectra. The prediction performance of SIMS was benchmarked extensively against annotated reference spectral data sets and compared favorably with, and was complementary to, current state-of-the-art methods. An exhaustive discovery screen using SIMS also revealed thousands of previously overlooked putative PTMs in a compendium of yeast protein complexes and in a proteome-wide map of adult mouse cardiomyocytes. We demonstrate that SIMS, freely accessible for academic research use, addresses gaps in current proteomic data interpretation pipelines, improving overall detection coverage, and facilitating comprehensive investigations of the fundamental multiplicity of the expressed proteome.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Unrestricted coupled cluster spin contamination corrected [UCCSD(T)] and unrestricted Brueckner doubles [UBD(T)] variations of the Weizmann-1 theory (W1), denoted as W1U, W1Usc, and W1BD, respectively, are compared with the restricted open-shell W1 theory [W1(RO)]. The performances of the four W1 variants are assessed with 220 total atomization energies, electron affinities, ionization potentials, and proton affinities in the G2/97 test set, for consistency with the error analysis of the original W1(RO) study. The root-mean-square deviations from the experiment of W1U (0.65 ± 0.48 kcal/mol), W1Usc (0.57 ± 0.48 kcal/mol), W1BD (0.62 ± 0.48 kcal/mol), and W1(RO) (0.57 ± 0.48 kcal/mol) show that the four methods are virtually indistinguishable. This error analysis excludes the “singlet biradicals,” C2 and O3, since single determinantal methods are not really adequate for these strongly multireference systems. The unrestricted W1 variants perform poorly for such highly spin-contaminated and multireference species (the largest deviation from experiment for W1Usc is −4.2 ± 0.1 kcal/mol for the O3 EA). W1(RO) performs much better than its unrestricted counterparts for these pathological cases (the deviation from experiment is reduced to −1.5 ± 0.1 kcal/mol for the O3 EA), though the errors are significantly larger than those for the overall test set. The examples of C2, O3, and the F2 potential energy curve indicate that an advantage to using W1BD is that the error in ⟨S2⟩ correlates with the magnitude of the error in energy, whereas W1(RO) loses accuracy without such a warning.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Hinglish language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
GPQA stands for Graduate-Level Google-Proof Q&A Benchmark. It's a challenging dataset designed to evaluate the capabilities of Large Language Models (LLMs) and scalable oversight mechanisms. Let me provide more details about it:
Description: GPQA consists of 448 multiple-choice questions meticulously crafted by domain experts in biology, physics, and chemistry. These questions are intentionally designed to be high-quality and extremely difficult. Expert Accuracy: Even experts who hold or are pursuing PhDs in the corresponding domains achieve only 65% accuracy on these questions (or 74% when excluding clear mistakes identified in retrospect). Google-Proof: The questions are "Google-proof," meaning that even with unrestricted access to the web, highly skilled non-expert validators only reach an accuracy of 34% despite spending over 30 minutes searching for answers. AI Systems Difficulty: State-of-the-art AI systems, including our strongest GPT-4 based baseline, achieve only 39% accuracy on this challenging dataset.
The difficulty of GPQA for both skilled non-experts and cutting-edge AI systems makes it an excellent resource for conducting realistic scalable oversight experiments. These experiments aim to explore ways for human experts to reliably obtain truthful information from AI systems that surpass human capabilities¹³.
In summary, GPQA serves as a valuable benchmark for assessing the robustness and limitations of language models, especially when faced with complex and nuanced questions. Its difficulty level encourages research into effective oversight methods, bridging the gap between AI and human expertise.
(1) [2311.12022] GPQA: A Graduate-Level Google-Proof Q&A Benchmark - arXiv.org. https://arxiv.org/abs/2311.12022. (2) GPQA: A Graduate-Level Google-Proof Q&A Benchmark — Klu. https://klu.ai/glossary/gpqa-eval. (3) GPA Dataset (Spring 2010 through Spring 2020) - Data Science Discovery. https://discovery.cs.illinois.edu/dataset/gpa/. (4) GPQA: A Graduate-Level Google-Proof Q&A Benchmark - GitHub. https://github.com/idavidrein/gpqa. (5) Data Sets - OpenIntro. https://www.openintro.org/data/index.php?data=satgpa. (6) undefined. https://doi.org/10.48550/arXiv.2311.12022. (7) undefined. https://arxiv.org/abs/2311.12022%29.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
EMODnet Chemistry aims to provide access to marine chemistry data sets and derived data products concerning eutrophication, ocean acidification and contaminants. The chemicals chosen reflect importance to the Marine Strategy Framework Directive (MSFD). This regional aggregated dataset contains all unrestricted EMODnet Chemistry data on contaminants (24 parameters), and covers the Mediterranean Sea with 4517 CDI records divided per matrices: 520 biota profiles, 560 water profiles, 3437 sediment (26 Vertical profiles and 3411 Time series). In the water dataset, the vertical profiles temporal range is from 1974-09-12 to 2015-09-14. In sediment dataset, vertical profiles temporal range is from 2010-08-02 to 2014-09-28 and time series temporal range is from 1981-06-27 to 2018-08-02. In biota time series temporal range is from 1979-03-29 to 2017-03-15. Data were aggregated and quality controlled by ‘Hellenic Centre for Marine Research, Hellenic National Oceanographic Data Centre (HCMR/HNODC)’ from Greece. Regional datasets concerning contaminants are automatically harvested. Parameter names in these datasets are based on P01, BODC Parameter Usage Vocabulary, which is available at: http://seadatanet.maris2.nl/bandit/browse_step.php . Each measurement value has a quality flag indicator. The resulting data collections for each Sea Basin are harmonised, and the collections are quality controlled by EMODnet Chemistry Regional Leaders using ODV Software and following a common methodology for all Sea Regions. Harmonisation means that: (1) unit conversion is carried out to express contaminant concentrations with a limited set of measurement units (according to EU directives 2013/39/UE; Comm. Dec. EU 2017/848) and (2) merging of variables described by different “local names” ,but corresponding exactly to the same concepts in BODC P01 vocabulary. The harmonised dataset can be downloaded as ODV spreadsheet (TXT file), which is composed of metadata header followed by tab separated values. This worksheet can be imported to ODV Software for visualisation (More information can be found at: https://www.seadatanet.org/Software/ODV ). The same dataset is offered also as XLSX file in a long/vertical format, in which each P01 measurement is a record line. Additionally, there are a series of columns that split P01 terms in subcomponents (measure, substance, CAS number, matrix...).This transposed format is more adapted to worksheet applications users (e.g. LibreOffice Calc). The 24 parameter names in this metadata record are based on P02, SeaDataNet Parameter Discovery Vocabulary, which is available at: http://seadatanet.maris2.nl/v_bodc_vocab_v2/vocab_relations.asp?lib=P02 . Detailed documentation will be published soon. The original datasets can be searched and downloaded from EMODnet Chemistry Download Service: https://emodnet-chemistry.maris.nl/search
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Polish language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Czech language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
EMODnet Chemistry aims to provide access to marine chemistry data sets and derived data products concerning eutrophication, ocean acidification and contaminants. The chemicals chosen reflect importance to the Marine Strategy Framework Directive (MSFD). This regional aggregated dataset contains all unrestricted EMODnet Chemistry data on contaminants; temperature, salinity and additional sampling parameters are included when available. The spatial coverage is the Black Sea with 20693 CDI records divided per matrices: 18834 water profiles, 1852 sediment profiles and 7 biota profiles. Vertical profiles temporal range is from 1974-08-24 to 2019-12-08 for water, from 1990-07-30 to 2017-10-02 for sediment and from 2008-05-15 to 2008-08-15. Data were harmonised and quality controlled by ‘National Institute for Marine Research and Development "Grigore Antipa"’ from Romania.
Regional datasets concerning contaminants are automatically harvested. Parameter names in these datasets are based on P01, BODC Parameter Usage Vocabulary, which is available at: https://vocab.seadatanet.org/p01-facet-search. Each measurement value has a quality flag indicator. The resulting data collections for each Sea Basin are harmonised, and the collections are quality controlled by EMODnet Chemistry Regional Leaders using ODV Software and following a common methodology for all Sea Regions.
Harmonisation means that: (1) unit conversion is carried out to express contaminant concentrations with a limited set of measurement units (according to EU directives 2013/39/UE; Comm. Dec. EU 2017/848) and (2) merging of variables described by different “local names”, but corresponding exactly to the same concepts in BODC P01 vocabulary.
Detailed documentation is available at: https://doi.org/10.6092/8b52e8d7-dc92-4305-9337-7634a5cae3f4
Explore and extract data at: https://emodnet-chemistry.webodv.awi.de/contaminants%3EBlackSea
The harmonised dataset can also be downloaded as ODV spreadsheet (TXT file), which is composed of metadata header followed by tab separated values. This worksheet can be imported to ODV Software for visualisation (More information can be found at: https://www.seadatanet.org/Software/ODV ).
The same dataset is offered also as TXT file in a long/vertical format, in which each P01 measurement is a record line. Additionally, there are a series of columns that split P01 terms in subcomponents (measure, substance, CAS number, matrix...).This transposed format is more adapted to worksheet applications users (e.g. LibreOffice Calc).
The original datasets can be searched and downloaded from EMODnet Chemistry Chemistry CDI Data and Discovery Access Service: https://emodnet-chemistry.maris.nl/search
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Chinese language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Italian language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Hungarian language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Spanish language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of other language ( whose names not mentioned ) movies with all certification categories. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories for Languages with Fewer Movies. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of Musical movies with all certification categories. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Silent language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Hindustani language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.
https://dataful.in/terms-and-conditionshttps://dataful.in/terms-and-conditions
This Dataset contains year-wise list of movies with all certification categories in Russian language. It has other details like certification date, movie length, certificate registration office, producer name etc. Note: 1) A-Certification means movies restricted to adult audiences 2) UA-certification means Unrestricted public exhibition subject to parental guidance for children below the age of twelve 3) U-Certificate means Unrestricted for Public Exhibition 4) S-Certification means movies restricted to specialized audiences such as doctors or scientists 5) The movie_length column is not properly defined at the source. The value mentioned in the movie_length column can either mean meters (for celluloid version) or minutes (for video version). For both meters and minutes, the unit is given as Mts at the source. 6) The data include not just feature films, but also short films, promos, songs, etc.