TejKshetra: A Dataset for Solar Farms Potential Site Mapping using Suitability parameters in India
Description
This dataset is a structured, high-precision compilation of environmental and solar irradiance data sourced from NASA POWER, covering India's major geographic zones over the span of one year (January to December 2022). It includes seven critical parameters—such as solar radiation, temperature, cloud cover, albedo, and precipitation—essential for evaluating solar power generation potential. The primary goal of the dataset is to aid in the identification of optimal locations for solar energy infrastructure by applying geospatial and machine learning techniques. Carefully preprocessed for consistency and organized for ease of use, this dataset is not only useful for current solar site suitability analysis but also offers long-term value to researchers, urban planners, and policymakers. It supports advanced analytics like clustering, classification, and visualizations, and can serve as a foundation for predictive modeling, transfer learning, and sustainability-oriented decision-making in the field of renewable energy.
Files
Steps to reproduce
Solar dataset comprises an extensive collection of solar irradiance and environmental parameter records, spanning a 10-year period (monthly data), sourced from NASA POWER Data Access Viewer. It focuses on crucial climate and geographic parameters essential for solar energy site suitability analysis, ensuring a comprehensive understanding of solar power potential. This include fetching data over all months of year to give overall idea for solar power plant. To ensure uniformity, a python script was used during pre-processing to combine all data to a standard required for the analysis . In essence, collecting data form source was first step. This data then underwent a quality check and standardization process to become part of final dataset.