Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This dataset is the result of my study on web-scraping of English Wikipedia in R and my tests on regression and classification modelization in R.
The content is create by reading the appropriate articles in English Wikipedia about Italian cities: I did'nt run NPL analisys but only the table with the data and I ranked every city from 0 to N in every aspect. About the values, 0 means "*the city is not ranked in this aspect*" and N means "*the city is at first place, in descending order of importance, in this aspect* ". If there's no ranking in a particular aspect (for example, the only existence of the airports/harbours with no additional data about the traffic or the size), then 0 means "*no existence*" and N means "*there are N airports/harbours*". The only not-numeric column is the column with the name of the cities in English form, except some exceptions (for example, "*Bra (CN)* " because of simplicity.
I acknowledge the Wikimedia Foundation for his work, his mission and to make available the cover image of this dataset, (please read the article "The Ideal city (painting)") . I acknowledge too StackOverflow and Cross-Validated to be the most important focus of technical knowledge in the world, all the people in Kaggle for the suggestions.
As a beginner in data analisys and modelization (Ok, I passed the exam of statistics in Politecnico di Milano (Italy), but there are more than 10 years that I don't work in this topic and my memory is getting old ^_^) I worked more on data clean, dataset building and building the simplest modelization.
You can use this datase to realize which city is good to live or to expand this to add some other data from Wikipedia (not only reading the tables but too to read the text adn extrapolate the data from the meaningless text.)
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
IT: Proportion of People Living Below 50 Percent Of Median Income: % data was reported at 15.300 % in 2021. This records a decrease from the previous number of 15.600 % for 2020. IT: Proportion of People Living Below 50 Percent Of Median Income: % data is updated yearly, averaging 14.050 % from Dec 1977 (Median) to 2021, with 36 observations. The data reached an all-time high of 16.200 % in 1993 and a record low of 9.700 % in 1982. IT: Proportion of People Living Below 50 Percent Of Median Income: % data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Italy – Table IT.World Bank.WDI: Social: Poverty and Inequality. The percentage of people in the population who live in households whose per capita income or consumption is below half of the median income or consumption per capita. The median is measured at 2017 Purchasing Power Parity (PPP) using the Poverty and Inequality Platform (http://www.pip.worldbank.org). For some countries, medians are not reported due to grouped and/or confidential data. The reference year is the year in which the underlying household survey data was collected. In cases for which the data collection period bridged two calendar years, the first year in which data were collected is reported.;World Bank, Poverty and Inequality Platform. Data are based on primary household survey data obtained from government statistical agencies and World Bank country departments. Data for high-income economies are mostly from the Luxembourg Income Study database. For more information and methodology, please see http://pip.worldbank.org.;;The World Bank’s internationally comparable poverty monitoring database now draws on income or detailed consumption data from more than 2000 household surveys across 169 countries. See the Poverty and Inequality Platform (PIP) for details (www.pip.worldbank.org).
https://www.gesis.org/en/institute/data-usage-termshttps://www.gesis.org/en/institute/data-usage-terms
This survey is a not up-to-date version. Please, use the updated version included in the EVS integrated data files. This national dataset is only available for replication purposes and analysis with additional country-specific variables (see ´Further Remarks´).
Two online overviews offer comprehensive metadata on the EVS datasets and variables.
The extended study description for the EVS 2008 provides country-specific information on the origin and outcomes of the national surveys The variable overview of the four EVS waves 1981 1990 1999/2000 and 2008 allows for identifying country specific deviations in the question wording within and across the EVS waves.
These overviews can be found at: Extended Study Description Variable Overview
Moral, religious, societal, political, work, and family values of Europeans.
Topics: 1. Perceptions of life: importance of work, family, friends and acquaintances, leisure time, politics and religion; frequency of political discussions with friends; happiness; self-assessment of own health; memberships and unpaid work (volunteering) in: social welfare services, religious or church organisations, education, or cultural activities, labour unions, political parties, local political actions, human rights, environmental or peace movement, professional associations, youth work, sports clubs, women´s groups, voluntary associations concerned with health or other groups; tolerance towards minorities (people with a criminal record, of a different race, left/right wing extremists, alcohol addicts, large families, emotionally unstable people, Muslims, immigrants, AIDS sufferers, drug addicts, homosexuals, Jews, gypsies and Christians - social distance); trust in people; estimation of people´s fair and helpful behaviour; internal or external control; satisfaction with life.
Work: reasons for people to live in need; importance of selected aspects of occupational work; employment status; general work satisfaction; freedom of decision-taking in the job; importance of work (work ethics, scale); important aspects of leisure time; attitude towards following instructions at work without criticism (obedience work); give priority to nationals over foreigners as well as men over women in jobs.
Religion: Individual or general clear guidelines for good and evil; religious denomination; current and former religious denomination; current frequency of church attendance and at the age of 12; importance of religious celebration at birth, marriage, and funeral; self-assessment of religiousness; churches give adequate answers to moral questions, problems of family life, spiritual needs and social problems of the country; belief in God, life after death, hell, heaven, sin and re-incarnation; personal God versus spirit or life force; own way of connecting with the divine; interest in the sacred or the supernatural; attitude towards the existence of one true religion; importance of God in one´s life (10-point-scale); experience of comfort and strength from religion and belief; moments of prayer and meditation; frequency of prayers; belief in lucky charms or a talisman (10-point-scale); attitude towards the separation of church and state.
Family and marriage: most important criteria for a successful marriage (scale); attitude towards childcare (a child needs a home with father and mother, a woman has to have children to be fulfilled, marriage is an out-dated institution, woman as a single-parent); attitude towards marriage, children, and traditional family structure (scale); attitude towards traditional understanding of one´s role of man and woman in occupation and family (scale); attitude towards: respect and love for parents, parent´s responsibilities for their children and the responsibility of adult children for their parents when they are in need of long-term care; importance of educational goals; attitude towards abortion.
Politics and society: political interest; political participation; preference for individual freedom or social equality; self-assessment on a left-right continuum (10-point-scale); self-responsibility or governmental provision; free decision of job-taking of the unemployed or no permission to refuse a job; advantage or harmfulness of competition; liberty of firms or governmental control; equal incomes or incentives for indivi...
https://www.futurebeeai.com/policies/ai-data-license-agreementhttps://www.futurebeeai.com/policies/ai-data-license-agreement
The English-Italian Gaming Parallel Corpora is a curated bilingual dataset designed to support game localization, machine translation, and language model training for the Gaming industry. It consists of over 50,000 sentence pairs, professionally translated between English and Italian, capturing the linguistic and cultural depth of gaming content.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The benchmark interest rate in Italy was last recorded at 4.50 percent. This dataset provides - Italy Interest Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.
https://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_VAR.pdf
https://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdfhttps://catalog.elra.info/static/from_media/metashare/licences/ELRA_END_USER.pdf
The VIT, Venice Italian Treebank is the effort of the collaboration of people working at the Laboratory of Computational Linguistics of the University of Venice in the years 1995-2005. It is partly the result of annotation carried out internally with no specific project in mind and no financial support. This work was partly related to the development of a lexicon, a morphological analyzer, a tagger, a deep parser of Italian. All these resources were finally ready at the beginning of the ‘90s when the LCL got involved in the first national projects. The VIT contains about 272,000 words distributed over six different domains, and this is what makes it so relevant for the study of the structure of Italian language. The following domains were annotated: Domain Number of words Time spanBureaucratic 20,000 1986 Politics 40,000 1984Economic & financial 12,000 1987Literary 10,000 1984Scientific 20,000 1985News 170,000 1994In addition, some 60,000 tokens of spoken dialogues in different Italian varieties were annotated.The annotation follows general X-bar criteria with 29 constituency labels and 102 PoS tags. VIT is also made available in a broad annotation version with 10 constituency labels and 22 PoS tags for machine learning purposes.The format is plain text with square bracketing. However, a UPenn style version which is readable by the open source query language CorpusSearch is also provided.Version 2 also available here: http://catalog.elra.info/en-us/repository/browse/ELRA-W0324/
In 2019, there were approximately 302,020 British citizens living in Spain, with a further 293,061 in Ireland and 176,672 in France. By comparison, there were only 604 British people living in Slovenia, the fewest of any European Union member state. As a member of the European Union, British citizens had the right to live and work in any EU member state. Although these rights were lost for most British citizens after the UK left the EU in 2020, Britons already living in EU states were able to largely retain their previous rights of residence. EU citizens living in the UK EU citizens living in the UK face the same dilemma that British nationals did regarding their legal status after Brexit. In the same year, there were 902,000 Polish citizens, 404,000 Romanians, and 322,000 people from the Republic of Ireland living in the UK in that year, along with almost two million EU citizens from the other 24 EU member states. To retain their rights after Brexit, EU citizens living in the UK were able to apply for the EU settlement scheme. As of 2025, there have been around 8.4 million applications to this scheme, with Romanian and Polish nationals the most common nationality at 1.87 million applications, and 1.27 million applications respectively. Is support for Brexit waning in 2024? As of 2025, the share of people in the UK who think leaving the EU was the wrong decision stood at 56 percent, compared with 31 percent who think it was the correct choice. In general, support for Brexit has declined since April 2021, when 46 percent of people supported Brexit, compared with 43 percent who regretted it. What people think Britain's relationship with the EU should be is, however, still unclear. A survey from November 2023 indicated that just 31 percent thought the UK should rejoin the EU, with a further 11 percent supporting rejoining the single market but not the EU. Only ten percent of respondents were satisfied with the current relationship, while nine percent wished to reduce ties even further.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
This dataset is the result of my study on web-scraping of English Wikipedia in R and my tests on regression and classification modelization in R.
The content is create by reading the appropriate articles in English Wikipedia about Italian cities: I did'nt run NPL analisys but only the table with the data and I ranked every city from 0 to N in every aspect. About the values, 0 means "*the city is not ranked in this aspect*" and N means "*the city is at first place, in descending order of importance, in this aspect* ". If there's no ranking in a particular aspect (for example, the only existence of the airports/harbours with no additional data about the traffic or the size), then 0 means "*no existence*" and N means "*there are N airports/harbours*". The only not-numeric column is the column with the name of the cities in English form, except some exceptions (for example, "*Bra (CN)* " because of simplicity.
I acknowledge the Wikimedia Foundation for his work, his mission and to make available the cover image of this dataset, (please read the article "The Ideal city (painting)") . I acknowledge too StackOverflow and Cross-Validated to be the most important focus of technical knowledge in the world, all the people in Kaggle for the suggestions.
As a beginner in data analisys and modelization (Ok, I passed the exam of statistics in Politecnico di Milano (Italy), but there are more than 10 years that I don't work in this topic and my memory is getting old ^_^) I worked more on data clean, dataset building and building the simplest modelization.
You can use this datase to realize which city is good to live or to expand this to add some other data from Wikipedia (not only reading the tables but too to read the text adn extrapolate the data from the meaningless text.)