16 datasets for Data in Brief
A dataset of voids’ characteristics in multidirectional carbon fiber/epoxy composite laminates, obtained using X-ray micro-computed tomography
Contributors: Mahoor Mehdikhani, Ilya Straumit, Larissa Gorbatikh, Stepan Lomov
... In the current data article, we present detailed characteristics of voids in carbon/epoxy composite laminates as well as the original image stacks, obtained via X-ray micro-Computed Tomography (micro-CT) . Five different lay-ups are produced with altering the recommended cure cycle in order to intentionally induce voids in the material. For each lay-up, an image stack (consisting of tomographic slices) and a dataset are provided. The image slices are in 8-bit TIF format. The datasets (spreadsheets) include the volume, size parameters, shape parameters, orientation, and location of all the detected voids in the specimen. The segmentation of the images and quantification of voids are performed in VoxTex, an in-house software for processing of micro-CT results. The data is linked to a Data in Brief article "A dataset of voids’ characteristics in multidirectional carbon fiber/epoxy composite laminates, obtained using X-ray micro-computed tomography" and linked to the article "Mehdikhani et al. Detailed characterization of voids in multidirectional carbon fiber/epoxy composite laminates using X-ray micro-computed tomography. Comp Part A. in press.".
Data from : Plastic debris dataset on the Seine river banks: plastic pellets, unidentified plastic fragments and plastic sticks are the Top 3 items in a historical accumulation of plastics.
Contributors: Romain Tramoy, Laurent Colasse, Johnny Gasperi, bruno tassin
... Inventory of plastic items collected in a quadrat of 1m2 in a historical polluted shore in the Seine river (downstream of the estuary; Lat. 49.4339; Long. 0.6160). Data are representative of the historical plastic pollution occurring in this river with few items dated from 1965, 1974, 1983, 1992 or 2010. Plastic items were classified according to OSPAR and MSFD classifications, which give insights about the origin of the items and their chemical composition. A total of 20 259 plastic debris were individually counted, classified and weighted by category. Those plastic debris are more than 150% heavier in mass (> 4kg) than organic debris, i.e. dead vegetation and gastropod shells, found in this kind of dry march surrounded by reedbeds. The Top 3 categories of items collected are plastic preproduction pellets, unidentified plastic fragments and plastic sticks (cotton bud and lollipop sticks). Plastic preproduction pellets are 15 times more numerous than gastropod shells. High concentration of pellets could be linked to the vicinity of plastic manufacturers near the sampled site. They represent 50% of the items collected during this campaign but only 5.6% of the mass. In contrast, around 30% of the mass is carried by the unidentified fragments of macroplastics >2.5cm, which only represent 7% of the total items. Hundreds of caps, lids, and rings were also found without their associated bottles, which are often prompt to sink. Furthermore, the dataset refers to specific activities in the estuary with for example 100 g/m2 of polyethylene from shotgun plastic wads related to intense hunting activities. Those items have to be mentioned because they are very common in the estuary and their origin is clearly identified, while alternatives such as biodegradable wads do exist. Reporting number of items, associated mass and volumes will improve conversions of unit for other studies related to river pollution when only one of the units are available. To facilitate conversions, mass per item were also reported for the Top 10 items. In addition, specific items such as plastic tag ties (e.g. textile), or plastic fibers from toilet brushes were unusually reported and should be considered as additional categories in OSPAR/MSFD classifications for rivers.
Contributors: Johan W. Joubert
... The data accompanying this article include the compressed, Extensible Markup Language (XML) files of the synthetic populations for the nine areas of importance in South Africa. The provided populations are controlled at the household level using (household) income, and at individual levels using gender and population group. The result provides a complete stock of individuals while accounting for detailed demographic, socioeconomic information, and household structure. The detailed XML Schema Definition (XSD) and XML Document Type Definition (DTD), which contains the declarations that describes the formal acceptable structure of the XML file, is available on http://www.matsim.org/files/dtd/. More specifically, there is one XSD definition for the household file, households_v1.0.xsd, and one DTD file for the individuals, population_v6.dtd. The files are normal XML and readable using many parsers. Our choice to use the Multi-Agent Transport Simulation (MATSim) infrastructure is because the populations are, in our context, frequently used for large-scale mesoscopic transport models using the agent-based MATSim.
Contributors: Evgeny Kuzmin
... The processed data include the aggregated information on the Russian companies, established and operated in 1991-2015, the survey also optionally covered the 1987-1990 period. Main computational parameters within the empirical survey include the information on a number of newly established companies; a natural loss of newly established companies (cumulatively and annually); a specific corporate survival rate; an adjusted and unadjusted life length.
Contributors: Trevor Williams, Sonia M Leach, Weiguo Feng
... These data are related to the research articles entitled "Spatial and Temporal Analysis of Gene Expression during Growth and Fusion of the Mouse Facial Prominences" by Feng et al, 2009 and “Systems Biology of facial development: contributions of ectoderm and mesenchyme” by Hooper et al. (submitted). Contained here is a new HTML library wherein individual gene expression profiles are presented as an indexed and clickable list of the genes, each linked to a graphical image of that gene's expression profile in the ectoderm, mesenchyme, or intact prominence, and augmented by functional annotations relevant for craniofacial biology. The data represent a valuable visual alternative to the raw data available elsewhere for analysis purposes (The raw data (.cel files) and normalized expression data are available at GEO (www.ncbi.nlm.nih.gov/geo) under accession numbers GSE7759  and GSE62214  and at FaceBase (www.facebase.org) under accession FB00000803). The data presented here as HTML pages will enable investigators to obtain a rapid assessment of the relative expression level of any gene on the array with respect to time, tissue, prominence and expression trajectory. The data are contained in a tar.gz archive, TrevorHTMLs.tar.gz, designed to be uncompressed in a single location, where hyperlinks use relative file path names to navigate the set of files. Within this folder are three items: two folders and an index.html. Opening the top-level index.html file in a web browser will provide information concerning the two data sets. The index provides an overview, using color-coded boxes for each named gene, to illustrate the gene expression profile found within each of the data sets. Hyperlinks are also available to access details for each gene in the Whole Prominence or Ectoderm/Mesenchyme data sets. The two subfolders, WholeProminence and EctoMesen, contain expression profiles and database annotations for every named gene available as .html pages, indexed by gene name for the two studies. Again, within each data subfolder there is a specific index for the whole prominence or ectoderm/mesenchyme dataset . There is also a folder "HTMLS" that connects directly to the list of genes and a folder "JPEGS" which has a list of images used to populate the html pages. Within the whole prominence or ectoderm/mesenchyme datasets, a gene-specific webpage visualizes the expression and detection values for each gene as heatmaps and line graphs (raw and log2 scale). Each gene-specific webpage also lists annotations from the Mammalian Phenotype, Kyoto Encyclopedia of Genes and Genomes (KEGG), InterPro and Gene Ontology (GO) databases. Terms relevant to craniofacial biology are highlighted in red.
Gender specific hippocampal whole genome transcriptome data from mice lacking the Cav3.2 T-type voltage-gated calcium channel
Contributors: Marco Weiergräber
... Hippocampus was isolated from Cav3.2+/+, Cav3.2+/- and Cav3.2-/- mice (Chen et al., 2003) from both genders. Following hippocampal RNA isolation, microarray procedures were carried out to acquire the transcriptome profile of the animals. The uploaded raw data files for each animal start with the sample number. The sample numbers for the individual groups are as follows (see also related "Data in Brief" article): Males: Cav3.2+/+: n = 3 (♂) sample # 19 - 21; Cav3.2+/-: n = 3 (♂) sample # 22 - 24; Cav3.2-/-: n = 3 (♂) sample # 25 - 27. Females: Cav3.2+/+: n = 3 (♀) sample # 28 - 30; Cav3.2+/-: n = 3 (♀) sample # 31 - 33; Cav3.2-/-: n = 3 (♀) sample # 34 - 36.
Gender specific hippocampal whole genome transcriptome data from mice lacking the Cav2.3 R-type voltage-gated calcium channel
Contributors: Marco Weiergräber
... Hippocampus was isolated from Cav2.3+/+, Cav2.3+/- and Cav2.3-/- mice (Wilson et al., 2000) from both genders. Following hippocampal RNA isolation, microarray procedures were carried out to acquire the transcriptome profile of the animals. The uploaded raw data files for each animal start with the sample number. The sample numbers for the individual groups are as follows (see also related "Data in Brief" article): Males: Cav2.3+/+: n = 3 (♂) sample # 1 - 3; Cav2.3+/-: n = 3 (♂) sample # 4 - 6; Cav2.3-/-: n = 3 (♂) sample # 7 - 9. Females: Cav2.3+/+: n = 3 (♀) sample # 10 - 12; Cav2.3+/-: n = 3 (♀) sample # 13 - 15; Cav2.3-/-: n = 3 (♀) sample # 16 - 18.
Contributors: Nabeel Mohammed, Sifat Momen, Anowarul Abedin, Mithun Biswas, Rafiqul Islam, Gautam Shom, Md. Shopon
... This dataset, BanglaLekha-Isolated, is a collection of Bangla handwritten isolated character samples . It contains samples of 50 Bangla basic characters, 10 Bangla numerals and 24 selected compound characters. 2000 handwriting samples for each of the 84 characters were collected, digitized and pre-processed. After discarding mistakes and scribbles, 1,66,105 handwritten character images were included in the final dataset. The dataset also includes information about the age and gender of the subjects from whom the handwriting samples were collected. This information is mapped to each individual image. A separate spreadsheet gives an assessment of the aesthetic quality of the handwriting samples, collected from three independent assessors. This assessment is done on groups of 84 characters and not on individual characters.
Contributors: Elizabeth Holm, Brian DeCost
... This data article presents a data set comprised of 2048 synthetic scanning electron microscope (SEM) images of powder materials and descriptions of the corresponding 3D structures that they represent. These images were created using Blender, an open source computer graphics suite, and the generating scripts are included with the data set. Eight particle size distributions are represented with 256 independent images from each. The particle size distributions are relatively similar to each other, so that the dataset offers a useful benchmark to assess the fidelity of image analysis techniques.
Diffusion tensor microscopy data (15.6um in-plane) of white matter tracts in the human, pig, and rat spinal cord with corresponding tissue histology
Contributors: Jeremy Flint, Brian Hansen, Stephen Blackband
... This data consists of nine diffusion tensor imaging (DTI) datasets acquired with magnetic resonance microscopy (MRM, 15.6 um in-plane) as well as the corresponding histology for each of the nine samples. Cross sections of fixed spinal cord from human (6), rat (2) and pig (1) sources were imaged along the boundary of gray and white matter in the ventral horn. Following MR collections, the samples were stained using Nissl or Black Gold II to label alpha-motor neuron cell bodies and myelin respectively. After staining, a second series of images were obtained using bright-field microscopy techniques. Thus, the corresponding histology images can be used as a reference for the microscopic structures (cell bodies, axon bundles) visualized by the MR microscopy data.