2 datasets found
  1. 1117 Russian cities with city name, region, geographic coordinates and 2020...

    • zenodo.org
    • explore.openaire.eu
    csv
    Updated Aug 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Evgeniy Pogrebnyak; Evgeniy Pogrebnyak; Kirill Artemov; Kirill Artemov (2021). 1117 Russian cities with city name, region, geographic coordinates and 2020 population estimate [Dataset]. http://doi.org/10.5281/zenodo.5151423
    Explore at:
    csvAvailable download formats
    Dataset updated
    Aug 6, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Evgeniy Pogrebnyak; Evgeniy Pogrebnyak; Kirill Artemov; Kirill Artemov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Russia
    Description

    1117 Russian cities with city name, region, geographic coordinates and 2020 population estimate.

    How to use

    from pathlib import Path
    import requests
    import pandas as pd
    
    url = ("https://raw.githubusercontent.com/"
       "epogrebnyak/ru-cities/main/assets/towns.csv")
    
    # save file locally
    p = Path("towns.csv")
    if not p.exists():
      content = requests.get(url).text
      p.write_text(content, encoding="utf-8")
    
    # read as dataframe
    df = pd.read_csv("towns.csv")
    print(df.sample(5))

    Files:

    Сolumns (towns.csv):

    Basic info:

    • city - city name (several cities have alternative names marked in alt_city_names.json)
    • population - city population, thousand people, Rosstat estimate as of 1.1.2020
    • lat,lon - city geographic coordinates

    Region:

    • region_name - subnational region (oblast, republic, krai or AO)
    • region_iso_code - ISO 3166 code, eg RU-VLD
    • federal_district, eg Центральный

    City codes:

    • okato
    • oktmo
    • fias_id
    • kladr_id

    Data sources

    Comments

    City groups

    • Ханты-Мансийский and Ямало-Ненецкий autonomous regions excluded to avoid duplication as parts of Тюменская область.

    • Several notable towns are classified as administrative part of larger cities (Сестрорецк is a municpality at Saint-Petersburg, Щербинка part of Moscow). They are not and not reported in this dataset.

    By individual city

    Alternative city names

    • We suppressed letter "ё" city columns in towns.csv - we have Орел, but not Орёл. This affected:

      • Белоозёрский
      • Королёв
      • Ликино-Дулёво
      • Озёры
      • Щёлково
      • Орёл
    • Дмитриев and Дмитриев-Льговский are the same city.

    assets/alt_city_names.json contains these names.

    Tests

    poetry install
    poetry run python -m pytest
    

    How to replicate dataset

    1. Base dataset

    Run:

    • download data stro rar/get.sh
    • convert Саратовская область.doc to docx
    • run make.py

    Creates:

    • _towns.csv
    • assets/regions.csv

    2. API calls

    Note: do not attempt if you do not have to - this runs a while and loads third-party API access.

    You have the resulting files in repo, so probably does not need to these scripts.

    Run:

    • cd geocoding
    • run coord_dadata.py (needs token)
    • run coord_osm.py

    Creates:

    • coord_dadata.csv
    • coord_osm.csv

    3. Merge data

    Run:

    • run merge.py

    Creates:

    • assets/towns.csv

  2. DATABASE: RUSSIAN LARGE URBAN REGIONS (LUR)

    • zenodo.org
    Updated Nov 25, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mikhail Rogov; Mikhail Rogov (2021). DATABASE: RUSSIAN LARGE URBAN REGIONS (LUR) [Dataset]. http://doi.org/10.5281/zenodo.3354436
    Explore at:
    Dataset updated
    Nov 25, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Mikhail Rogov; Mikhail Rogov
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Russia
    Description

    This database provides a construction of Large Urban Regions (LUR) in Russia. A Large Urban Region (LUR) can be defined as an aggregation of continuous statistical units around a core that are economically dependent on this core and linked to it by economic and social strong interdependences. The main purpose of this delineation is to make cities comparable on the national and world scales and to make comparative social-economic urban studies. Aggregating different municipal districts around a core city, we construct a single large urban region, which allows to include all the area of economic influence of a core into one statistical unit (see Rogov & Rozenblat, 2019 for more details). In doing so we use four principal urban concepts (Pumain et al., 1992): political definition, morphological definition, functional definition and conurbation that we call Large Urban Region. We implemented LURs using criteria such as population distribution, road networks, access to an airport, distance from a core, presence of multinational firms. In this database we provide population data for LURs and their administrative units.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Evgeniy Pogrebnyak; Evgeniy Pogrebnyak; Kirill Artemov; Kirill Artemov (2021). 1117 Russian cities with city name, region, geographic coordinates and 2020 population estimate [Dataset]. http://doi.org/10.5281/zenodo.5151423
Organization logo

1117 Russian cities with city name, region, geographic coordinates and 2020 population estimate

Explore at:
csvAvailable download formats
Dataset updated
Aug 6, 2021
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Evgeniy Pogrebnyak; Evgeniy Pogrebnyak; Kirill Artemov; Kirill Artemov
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
Russia
Description

1117 Russian cities with city name, region, geographic coordinates and 2020 population estimate.

How to use

from pathlib import Path
import requests
import pandas as pd

url = ("https://raw.githubusercontent.com/"
   "epogrebnyak/ru-cities/main/assets/towns.csv")

# save file locally
p = Path("towns.csv")
if not p.exists():
  content = requests.get(url).text
  p.write_text(content, encoding="utf-8")

# read as dataframe
df = pd.read_csv("towns.csv")
print(df.sample(5))

Files:

Сolumns (towns.csv):

Basic info:

  • city - city name (several cities have alternative names marked in alt_city_names.json)
  • population - city population, thousand people, Rosstat estimate as of 1.1.2020
  • lat,lon - city geographic coordinates

Region:

  • region_name - subnational region (oblast, republic, krai or AO)
  • region_iso_code - ISO 3166 code, eg RU-VLD
  • federal_district, eg Центральный

City codes:

  • okato
  • oktmo
  • fias_id
  • kladr_id

Data sources

Comments

City groups

  • Ханты-Мансийский and Ямало-Ненецкий autonomous regions excluded to avoid duplication as parts of Тюменская область.

  • Several notable towns are classified as administrative part of larger cities (Сестрорецк is a municpality at Saint-Petersburg, Щербинка part of Moscow). They are not and not reported in this dataset.

By individual city

Alternative city names

  • We suppressed letter "ё" city columns in towns.csv - we have Орел, but not Орёл. This affected:

    • Белоозёрский
    • Королёв
    • Ликино-Дулёво
    • Озёры
    • Щёлково
    • Орёл
  • Дмитриев and Дмитриев-Льговский are the same city.

assets/alt_city_names.json contains these names.

Tests

poetry install
poetry run python -m pytest

How to replicate dataset

1. Base dataset

Run:

  • download data stro rar/get.sh
  • convert Саратовская область.doc to docx
  • run make.py

Creates:

  • _towns.csv
  • assets/regions.csv

2. API calls

Note: do not attempt if you do not have to - this runs a while and loads third-party API access.

You have the resulting files in repo, so probably does not need to these scripts.

Run:

  • cd geocoding
  • run coord_dadata.py (needs token)
  • run coord_osm.py

Creates:

  • coord_dadata.csv
  • coord_osm.csv

3. Merge data

Run:

  • run merge.py

Creates:

  • assets/towns.csv

Search
Clear search
Close search
Google apps
Main menu