Abstract [Related Publication]:
Models of dispersal potential are required to predict connectivity between populations of sessile organisms. However, to date, such models do not allow for time‐varying rates of acquisition and loss of competence to settle and metamorphose, and permit only a limited range of possible survivorship curves. We collect high‐resolution observations of coral larval survival and metamorphosis, and apply a piecewise modeling approach that incorporates a broad range of temporally‐varying rates of mortality and loss of competence. Our analysis identified marked changes in competence loss and mortality rates, whose timing implicates developmental failure and depletion of energy reserves. Asymmetric demographic rates suggest more intermediate‐range dispersal, less local retention, and less long‐distance dispersal than predicted by previously‐employed non‐piecewise models. Because vital rates are likely temporally asymmetric, at least for non‐feeding broadcast‐spawned larvae, piecewise analysis of demographic rates will likely yield more reliable predictions of dispersal potential.
Usage Notes [Dryad]:
TenuisGBR2012longtermsurvival.csv
Empirical long term survival data for A. Tenuis larvae. "temp" is the temperature that the larvae were raised at, "rep" is the replicate number, "day" is the date the observation was taken, "age (h)" is the larval age (h) at the time of the observation, "age (d)" is the larval age (d) at the time of the observation, "larvae" is the number of larvae at the beginning of the experiment, "surv" is the number of larvae alive at the time of the observation.
A.tenuisGBR2012metamorphosis.csv
Empirical metamorphosis data for A. Tenuis larvae. "temp" is the temperature that the larvae were raised at, "date" is the date the observation was taken, "hour" is the hour of the day that the observation was taken, "age (h)" is the larval age (h) at the time of the observation, "age (d)" is the larval age (d) at the time of the observation, "rep" is the replicate number, "larvae" is the number of larvae at the beginning of the experiment, "meta" is the number of larvae that had metamorphosed by the time of the observation, "swimming" is the number of larvae swimming at the time of the observation.
Survival model fit R code
This code fits the best-fitting survival model to the data.
(survival model code.R)
Competence model fit R code
This code fits the best-fitting competence model to the data.
(competence model code.R)
The dataset also includes 2 README files (MS Word format) for the R codes.
This repository contains datasets relating to coronavirus in Sierra Leone, as well as on demographic and other information from the 2015 Population and Household Census (PHC). It also includes mapping shapefiles by district, so that you can map the district-level coronavirus statistics.
See here for a full description of how the data files have been created from the source data, including the R code.
Last updated: 10 June 2020.
The novel 2019 coronavirus (covid-19) arrived late to West Africa and Sierra Leone in particular. This dataset provides the number of reported cases on a district-by-district basis for Sierra Leone, as well as various additional statistics at the country level. In addition, I provide district-by-district data on demographics and households' main sources of information, both from the 2015 census. For convenience, I also provide shapefiles for mapping the 14 districts of Sierra Leone.
The dataset consists of four main files, which are in the output
folder. See the column descriptions below for further details.
Coronavirus confirmed cases by district (sl_districts_coronavirus.csv
). I found the original data by looking in the static/js/data
folder in the source code for covid19.mic.gov.sl, last accessed 10 June 2020. The file contains the cumulative number of confirmed coronavirus cases in the 14 districts of Sierra Leone as a time series. I have used the R tidyverse to reshape the data and ensure naming is consistent with the other data files.
Demographic statistics by district (sl_districts_demographics.csv
). Data from the 2015 Population and Housing Census (PHC), sourced from Open Data Sierra Leone. The dataset covers the 14 districts of Sierra Leone, which increased to 16 in 2017. Last accessed 10 June 2020.
Main Sources of Information by district (sl_districts_info_sources.csv
). Data from the 2015 Population and Housing Census (PHC), sourced from Open Data Sierra Leone. The dataset presents the main sources of information, such as television or radio, for households in the 14 districts of Sierra Leone. Last accessed 2 June 2020. I note that I have made one correction to the source data (see R code with correction here).
Country-wide coronavirus statistics for Sierra Leone (sl_national_coronavirus.csv
). The original data also comes from covid19.mic.gov.sl, last accessed 10 June 2020. The file contains numerous statistics as time series, listed in the Column Description section below. I note that there are various potential issues in the file which I leave the user to decide how to deal with (duplicate datetimes, inconsistent statistics).
Additionally I include a set of five files with district-by-district mapping (shapefiles) and other data, unchanged from their original source. Each file is labelled in the following way: sl_districts_mapping.*
. These files come from Direct Relief Open Data on ArcGIS Hub. The data also include district-level data on maternal child health attributes, which was the original context of the mapping data.
Coronavirus confirmed cases by district sl_districts_coronavirus.csv
:
date
: Date of reportingdistrict
: District of Sierra Leone (based on pre-2017 administrative boundaries)confirmed_cases
: Cumulative number of confirmed coronavirus cases; NA if no data reporteddecrease
: Dummy variable indicating whether the number of reported cases has been revised down. NA if no reported cases on that date; 1 if there is a decrease from the last reported cases; 0 otherwiseDemographic statistics by district sl_districts_demographics.csv
:
district
: District of Sierra Leone (based on pre-2017 administrative boundaries)d_code
: District coded_id
: District idtotal_pop
: Total population in districtpop_share
: District's share of total country populationt_male
: Total male populationt_female
: Total female populations_ratio
: (*) Sex ratio at birth (number of males for every 100 females, under the age of 1)t_urban
: Total urban populationt_rural
: Total rural populationprop_urban
: Proportion urbant_h_pop
: Sum of h_male
and h_female
h_male
: (?)h_female
: (?)t_i_pop
: Sum of i_male
and i_female
i_male
: (?)i_female
: (?)working_pop
: Working populationdepend_pop
: Dependent population...
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
import h5py
import numpy as np
index = 0 # adjust face index
datafile = h5py.File('dataset.hdf5', 'r')
image = datafile['dataset_0'][index][:256*256]
threshold = 20 # in dB
# scale the amplitude image logarithmically
image[math.isnan(image)] = 0
image = 20 * np.log10(image)
max = np.max(image)
min = max - threshold
image = (image - min) / (max - min)
image[image < 0] = 0
image.reshape((256,256))
# the depth image is between 0.22 m and 0.58 m
image_depth = datafile['dataset_0'][index][-256*256:]
image_depth = image_depth.reshape((256,256))
image_depth[image == 0] = 0.58 # ignore pixels that are ignored in the amlitude image
image_depth = np.nan_to_num(image_depth, nan=0.58)
image_depth[image_depth == 0] = 0.58
image_depth = (image_depth - 0.22) / (0.58-0.22)
# load other data (set start_index and end_index according to the data that shall be loaded)
data = datafile['dataset_0'][index][start_index:end_index]
We would like to thank the Rohde & Schwarz GmbH & Co. KG (Munich, Germany) for providing the radar imaging devices and technical support that made this study possible.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
NDVI Data Set (1. NDVI.nc)
Meteorological Data Set (2.Temperature.nc, ... , 6.Cloud_cover.nc)
Pre-processing code (Set_data_1~3)
Analysis code (code1~5)
Acknowledgments
This work was also supported by Global - Learning & Academic research institution for Master’s·PhD students, and Postdocs (LAMP) Program of the National Research Foundation of Korea (NRF) grant funded by the Ministry of Education (No. RS-2023-00301914).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains mitosis detections, mitotic network structures, and social network analysis (SNA) measures derived from 11,161 diagnostic slides in The Cancer Genome Atlas (TCGA). Mitoses were automatically identified using the MDFS algorithm [1], and each detected mitosis was converted into a node within a mitotic network. The resulting graphs are provided in JSON format, with each file representing a single diagnostic slide.
Each JSON file contains four primary fields:
edge_index
Two parallel lists representing edges between nodes. The ii-th element in the first list corresponds to the source node index, and the ii-th element in the second list is the target node index.
coordinates
A list of [x, y] positions for each node (mitosis). The (x,y) coordinates can be used for spatial visualization or further spatial analyses.
feats
A list of feature vectors, with each row corresponding to a node. These features include:
feat_names
The names of the features in feats
. The order matches the columns in each node’s feature vector.
{
"edge_index": [[1, 2, 6, 10], [2, 4, 8, 11]],
"coordinates": [[27689.0, 12005.0], [24517.0, 17809.0], ...],
"feats": [[1.0, 0.0, 0.0, 0.0], [1.0, 1.0, 0.0, 0.115], ...],
"feat_names": ["type", "Node_Degree", "Clustering_Coeff", "Harmonic_Cen"]
}
Below is a sample Python snippet to load one JSON file, extract node coordinates and the type
feature, and combine them into a single NumPy array:
import json
import numpy as np
# Path to your JSON file
json_file_path = "example_graph.json"
with open(json_file_path, 'r') as f:
data = json.load(f)
# Convert coordinates to NumPy
coordinates = np.array(data["coordinates"])
# Identify the "type" column
feat_names = data["feat_names"]
type_index = feat_names.index("type")
# Extract features and isolate the "type" column
feats = np.array(data["feats"])
node_types = feats[:, type_index].reshape(-1, 1)
# Combine x, y, and type into a single array (N x 3)
combined_data = np.hstack([coordinates, node_types])
print(combined_data)
To visualize or analyze the network structure, you can construct a NetworkX graph as follows:
import json
import networkx as nx
import matplotlib.pyplot as plt
json_file_path = "example_graph.json"
with open(json_file_path, "r") as f:
data = json.load(f)
# Create a NetworkX Graph
G = nx.Graph()
# Add each node with position attributes
for i, (x, y) in enumerate(data["coordinates"]):
G.add_node(i, pos=(x, y))
# Add edges using the parallel lists in edge_index
# (Adjust for 1-based indexing if necessary)
for src, dst in zip(data["edge_index"][0], data["edge_index"][1]):
G.add_edge(src, dst)
Having TIAToolbox installed, one can easily visualize the mitotic network on their respective whole slide images using the following command:
tiatoolbox visualize --slides path/to/slides --overlays path/to/overlays
The only thing to consider is that slides and overlays (provided graph json files) should have the same name. For more information, please refer to Visualization Interface Usage - TIA Toolbox 1.5.1 Documentation.
In case of using this dataset, please cite the following publication:
@article{jahanifar2024mitosis, title={Mitosis detection, fast and slow: robust and efficient detection of mitotic figures}, author={Jahanifar, Mostafa and Shephard, Adam and Zamanitajeddin, Neda and Graham, Simon and Raza, Shan E Ahmed and Minhas, Fayyaz and Rajpoot, Nasir}, journal={Medical Image Analysis}, volume={94}, pages={103132}, year={2024}, publisher={Elsevier} }
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Abstract [Related Publication]:
Models of dispersal potential are required to predict connectivity between populations of sessile organisms. However, to date, such models do not allow for time‐varying rates of acquisition and loss of competence to settle and metamorphose, and permit only a limited range of possible survivorship curves. We collect high‐resolution observations of coral larval survival and metamorphosis, and apply a piecewise modeling approach that incorporates a broad range of temporally‐varying rates of mortality and loss of competence. Our analysis identified marked changes in competence loss and mortality rates, whose timing implicates developmental failure and depletion of energy reserves. Asymmetric demographic rates suggest more intermediate‐range dispersal, less local retention, and less long‐distance dispersal than predicted by previously‐employed non‐piecewise models. Because vital rates are likely temporally asymmetric, at least for non‐feeding broadcast‐spawned larvae, piecewise analysis of demographic rates will likely yield more reliable predictions of dispersal potential.
Usage Notes [Dryad]:
TenuisGBR2012longtermsurvival.csv
Empirical long term survival data for A. Tenuis larvae. "temp" is the temperature that the larvae were raised at, "rep" is the replicate number, "day" is the date the observation was taken, "age (h)" is the larval age (h) at the time of the observation, "age (d)" is the larval age (d) at the time of the observation, "larvae" is the number of larvae at the beginning of the experiment, "surv" is the number of larvae alive at the time of the observation.
A.tenuisGBR2012metamorphosis.csv
Empirical metamorphosis data for A. Tenuis larvae. "temp" is the temperature that the larvae were raised at, "date" is the date the observation was taken, "hour" is the hour of the day that the observation was taken, "age (h)" is the larval age (h) at the time of the observation, "age (d)" is the larval age (d) at the time of the observation, "rep" is the replicate number, "larvae" is the number of larvae at the beginning of the experiment, "meta" is the number of larvae that had metamorphosed by the time of the observation, "swimming" is the number of larvae swimming at the time of the observation.
Survival model fit R code
This code fits the best-fitting survival model to the data.
(survival model code.R)
Competence model fit R code
This code fits the best-fitting competence model to the data.
(competence model code.R)
The dataset also includes 2 README files (MS Word format) for the R codes.