最新刊期

    REN Hongyan, HUANG Kun, ZHAXI

    DOI:10.11834/jrs.20265503
    img
    摘要:Objective The application of drone imagery combined with deep learning technology has become a crucial approach for identifying Himalayan marmots and monitoring their activity patterns at large spatial scales. Within this workflow, the accurate identification of marmot holes is essential because holes provide a stable spatial proxy for the presence and density of animals. However, current methods are still constrained in practice. Owing to uneven terrain, stones around or above burrow entrances, and soil mounds and other ground objects that partially or completely obstruct openings, a considerable proportion of marmot holes are missed during detection. Consequently, the accuracy and robustness of existing burrow-identification techniques remain insufficient for reliable use in plague-related surveillance. Method This study focuses on several typical natural plague foci in the Xizang region. We carried out detailed ground surveys of marmot holes, recording entrance locations, dimensions and associated habitat characteristics. In parallel, UAV remote-sensing campaigns were conducted to obtain high-resolution aerial imagery over the same sites. Based on these observations, we constructed a UAV image dataset that incorporates both marmot burrow entrances and surrounding habitat elements such as stones patches and soil mounds formed by excavation. On this basis, we designed a data-annotation strategy that explicitly integrates habitat information into the labeled objects. Five representative object detection models—YOLOv8, YOLO11, YOLO13, RT-DETR and RF-DETR—were then trained and evaluated under two settings: a traditional annotation method using only the visible burrow entrance, and the proposed habitat-integrated annotation strategy. By systematically comparing results from these two strategies, we assessed the effectiveness of incorporating habitat information into the annotation process. Result Under the traditional annotation method, the accuracy rates for detecting marmot holes achieved by the five models were 93.0% (YOLOv8), 91.6% (YOLO11), 85.5% (YOLO13), 76.5% (RT-DETR) and 83.0% (RF-DETR). When the habitat-integrated annotation strategy was adopted, the performance of most models improved. All models showed higher detection accuracy except for YOLOv8, whose accuracy decreased slightly by 0.2 percentage points. The accuracy gains were 2.2 percentage points for YOLO11, 9.0 percentage points for YOLO13, 16.7 percentage points for RT-DETR and 6.5 percentage points for RF-DETR. The combination of RT-DETR with the habitat-integrated strategy produced the largest improvement, with an increase of about 17%, while the system integrating YOLO11 with the habitat-integrated strategy achieved the best overall performance. Conclusion These results show that an annotation strategy integrating habitat information can effectively reduce the risk of missing marmot holes caused by stones cover, soil mounds and other forms of occlusion that limit traditional annotation methods. By providing richer contextual cues around the entrances, this strategy improves the accuracy and reliability of drone-based identification of Himalayan marmot holes. The proposed annotation framework enhances UAV-based burrow detection, enriches the application of unmanned aerial vehicles for investigating plague focuses associated with Himalayan marmots and has important implications for monitoring and controlling marmot-borne plague across the Qinghai–Tibet Plateau.  
    关键词:unmanned aerial vehicles;deep learning;data annotation strategy;habitat information;Himalayan marmot holes;plague surveillance   
    15
    |
    7
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 158427552 false
    更新时间:2026-06-16

    LI Xiaokui, YANG Shuwen, LI Ziyuan, WANG Wenju, ZHU Hao, XUE Yiming

    DOI:10.11834/jrs.20265468
    img
    摘要:In complex urban environments, dense buildings and significant height variations amplify nonlinear radiometric differences and geometric distortions among multimodal remote sensing images. These challenges severely hinder high-precision registration of high-resolution imagery. Existing registration approaches predominantly rely on two-dimensional features such as texture, often neglecting the valuable three-dimensional spatial information inherent in such scenes. To address these limitations, this study proposes a novel multimodal high-resolution remote sensing image registration method that integrates modality transformation with three-dimensional spatial relationship constraints, aiming to improve robustness and accuracy in complex urban scenarios. The proposed method consists of three main components. First, a cross-modal image translation technique is employed to reduce radiometric discrepancies between multimodal images, effectively narrowing the modality gap and facilitating subsequent feature extraction. Second, monocular depth estimation is introduced to efficiently generate depth maps from single images. These depth maps provide essential spatial priors and serve as a foundation for constructing more informative feature descriptors and enforcing spatial constraints. Finally, a matching strategy based on depth-guided 3D spatial relationship constraints is developed. This strategy includes multi-feature map keypoint detection to capture potential salient features, the construction of depth-enhanced joint descriptors to improve feature distinctiveness, and the incorporation of 3D spatial relationship constraints to ensure geometrically consistent matching. Together, these steps enable reliable detection, robust description, and accurate matching of feature points across multimodal images. The proposed method was compared with several traditional and deep learning methods on four representative multimodal remote sensing datasets. Experimental results show that the proposed method achieves an average Number of Correct Matches (NCM) of 510, which improves upon existing methods by a factor of 0.92 to 5.22. The average Root Mean Square Error (RMSE) is 1.58 pixels, and the registration accuracy is improved by a factor of 4.12 to 4.78 compared to state‑of‑the‑art methods. These results demonstrate that the proposed method has clear advantages in registration accuracy and overall performance, effectively suppressing nonlinear radiometric differences and geometric distortions in urban multimodal images and achieving high‑precision registration. This study presents a robust solution for multimodal high-resolution remote sensing image registration in complex urban environments. By combining cross-modal translation, monocular depth estimation, and depth-based three-dimensional spatial relationship constraints, the proposed method successfully addresses both nonlinear radiometric differences and geometric distortions. The integration of 3D spatial information significantly improves feature matching robustness and registration precision compared to conventional 2D-based approaches. Experimental validation confirms that the method achieves superior performance in terms of both accuracy and stability, establishing a solid foundation for downstream applications such as urban mapping, change detection, and multi-sensor data fusion.  
    关键词:image registration;multimodal imagery;3D spatial relationships;complex urban scenes;depth maps   
    35
    |
    17
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 158292679 false
    更新时间:2026-06-12

    CUI Ying, CHEN Yunhao, GENG Hao, LI Kangning, LI Xiaohui

    DOI:10.11834/jrs.20266070
    img
    摘要:Vegetation phenology reflects the integrated responses of ecosystems to climatic conditions, land surface modifications, and anthropogenic disturbances, and serves as a critical indicator for assessing ecosystem feedbacks to urbanization and climate change. Rapid urban expansion has substantially altered surface thermal environments, vegetation structure, and ecological processes, leading to differentiated phenological responses among urban cores, towns, and surrounding rural areas. Although previous studies have documented urban–rural phenological contrasts, most analyses have relied on moderate-resolution remote sensing products, which are subject to mixed-pixel effects and limited capacity to resolve fine-scale urban heterogeneity. Systematic investigations based on high spatial resolution data across large spatial extents remain insufficient. This study aims to (1) extract high-resolution vegetation phenology metrics for major Chinese cities using Sentinel-2 imagery, (2) quantify phenological differences along the urban–town–rural gradient, and (3) examine how these differences vary across vegetation types, climatic zones, and city-size categories. This study aims to (1) extract high-resolution vegetation phenology metrics for major Chinese cities using Sentinel-2 imagery, (2) quantify phenological differences along the urban–town–rural gradient, and (3) examine how these differences vary across vegetation types, climatic zones, and city-size categories. Using Sentinel-2 imagery from the Copernicus program (2019-2024), Enhanced Vegetation Index (EVI) time series were constructed for 128 cities across China and their adjacent town and rural areas. A dynamic threshold approach was applied to extract the Start of Season (SOS) and End of Season (EOS) from annual EVI trajectories. To ensure biological plausibility, phenological metrics were constrained within reasonable day-of-year ranges. The derived phenological dates were validated against ground-based observations and compared with the moderate-resolution MCD12Q2 phenology product to assess absolute accuracy using MAE and RMSE metrics. Urban–town–rural differences were quantified for each city and subsequently analyzed across vegetation types, climatic zones, and city-size classes. Statistical comparisons and regression analyses were employed to evaluate spatial heterogeneity and scaling patterns. Validation results indicate that Sentinel-2-derived phenology exhibits substantially lower MAE and RMSE values than the MCD12Q2 product, demonstrating improved absolute accuracy under heterogeneous urban landscapes. Nationally, urban vegetation shows a consistent spring advancement and autumn delay relative to surrounding areas. On average, urban SOS occurs 1.26 days earlier than in towns and 1.49 days earlier than in rural areas, while urban EOS is delayed by 1.51 days and 1.25 days relative to towns and rural areas, respectively, indicating an extended growing season in urban environments. Phenological responses vary significantly among vegetation types. Forest ecosystems exhibit the strongest sensitivity to urbanization, showing the largest magnitude of spring advancement and autumn delay. In contrast, other vegetation types display comparatively moderate responses. Climatic background further modulates urbanization effects. The temperate climate zone shows the most pronounced urban-rural phenological contrasts, whereas subtropical and tropical zones exhibit weaker and less stable differences. City size also influences phenological patterns. The advancement of urban SOS generally intensifies with increasing city size, suggesting a scaling effect associated with enhanced urban heat island intensity. However, the delay of EOS is more evident in small and medium sized cities and may weaken or even reverse in megacities, possibly due to complex interactions among thermal stress, vegetation management, and land surface heterogeneity. Overall, this study provides high-resolution, large-scale evidence of differentiated vegetation phenological responses to urbanization in China. Urban expansion systematically modifies growing season dynamics, characterized by earlier spring onset and delayed autumn senescence, although the magnitude and direction of these effects depend on vegetation type, climatic background, and city size. By leveraging Sentinel-2 imagery and a dynamic threshold extraction framework, this research improves the quantitative reliability of urban phenology assessment compared with conventional moderate-resolution products. The findings enhance understanding of how urbanization reshapes ecosystem seasonal dynamics and contribute to clarifying the socio-ecological implications of phenological shifts. These results provide scientific support for sustainable urban planning, ecological infrastructure optimization, and climate adaptation strategies under continued urban expansion.  
    关键词:remote sensing;urbanization;vegetation phenology;sentinel-2;dynamic threshold method;gradient difference   
    26
    |
    17
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 158241399 false
    更新时间:2026-06-12

    YANG Zhen, YAO Zongqi, ZHANG Xiaoli

    DOI:10.11834/jrs.20266167
    img
    摘要:Objective Accurate monitoring of forest resources at the individual tree level is fundamental for forest ecosystem management. Unmanned aerial vehicle (UAV) visible light (RGB) imagery provides a cost-effective and high-spatial-resolution data source for these wide-area monitoring tasks. High-spatial-resolution imagery comprehensively records the fine contours of trees and the background habitat of the forest. Utilizing panoptic segmentation technology for unified interpretation enables the synchronous extraction of all forest elements. Nevertheless, interpreting highly closed-canopy forest scenes remains a critical challenge. Traditional deep learning approaches often decouple semantic segmentation for background elements and instance segmentation for individual trees, leading to severe pixel-level classification conflicts and spatial topology inconsistencies. Furthermore, the limited spectral information in RGB imagery frequently causes severe spectral confusion among adjacent trees. To systematically address these challenges, this study proposes an end-to-end forest panoptic segmentation model named FSC-Mask2Former.Method The proposed FSC-Mask2Former builds upon the Mask2Former baseline by introducing two core architectural improvements tailored to the unstructured features of forests. First, a Frequency-domain Texture Awareness (FTA) module is incorporated into the feature extraction pathway to compensate for the loss of micro-texture details caused by spatial downsampling, essentially functioning as a learnable high-pass filter in the feature space to retain critical edge gradients. Second, an Instance-aware Query Contrastive (IQC) head is integrated at the output of the Transformer decoder to maximize the inter-class feature distance between spectrally similar tree species, imposing an anisotropic constraint on the feature distribution to enlarge decision boundaries and fundamentally suppress category assignment conflicts. To evaluate the model, a densely annotated dataset was constructed using UAV RGB imagery from Gaofeng Forest Farm in the Guangxi Zhuang Autonomous Region, supplemented by data from Genhe City in the Inner Mongolia Autonomous Region, Jixi County in Anhui Province, and Hengzhou City in the Guangxi Zhuang Autonomous Region to validate model transferability.Result Comprehensive experiments demonstrate that FSC-Mask2Former significantly outperforms existing mainstream networks. The model achieves an overall Panoptic Quality (PQ) of 57.0%, a substantial gain of 11.0 percentage points over the baseline. Most notably, the foreground Recognition Quality (RQ) reaches 56.0%, representing a 12.0 percentage point increase. Visualizations confirm that FSC-Mask2Former effectively separates touching instances in high-canopy-closure forest areas, precisely delineates boundaries for morphologically irregular canopies, and maintains the spatial coherence of background elements. Furthermore, multi-region experiments indicate robust generalization capabilities across different geographical and ecological conditions.Conclusion The proposed FSC-Mask2Former successfully overcomes the bottlenecks of spectral homogeneity and task separation in UAV-based forest interpretation. This research proves that accurate full-element forest mapping can be realized using universally accessible UAV RGB imagery, providing a practical, robust, and highly cost-effective technical paradigm for modern forest resource monitoring.  
    关键词:UAV remote sensing;Panoptic segmentation;Mask2Former;Frequency domain analysis;contrastive learning;Individual tree recognition   
    29
    |
    11
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 158241355 false
    更新时间:2026-06-12

    TAO Huimin, QIN Zhengkun, HAN Yang, HU Juyang, BI Yanmeng

    DOI:10.11834/jrs.20265518
    img
    摘要:Objective Leveraging the advantage of high-frequency observations, geostationary satellites effectively compensate for the limitations of polar-orbiting meteorological satellites in terms of long revisit cycles and insufficient timeliness in monitoring short-lived severe weather. Compared to infrared bands, microwave radiation is less affected by clouds and exhibits excellent penetration through non-precipitating clouds. Consequently, deploying microwave sounding instruments on geostationary orbits has become a crucial development direction in meteorological observation. The Fengyun-4 Geostationary Microwave Sounder (GeoMWS), currently under development in China, aims to achieve all-weather, all-day, and high-frequency three-dimensional observations of cloud and precipitation systems and their internal structures. This capability is of great significance for monitoring rapidly evolving weather systems and improving Numerical Weather Prediction (NWP). However, a prerequisite for satellite data assimilation (whether clear-sky or all-sky) is the accurate identification of clear-sky and cloudy pixels, along with the assignment of appropriate observation errors. Given that existing cloud detection algorithms are predominantly designed for polar-orbiting satellites and are difficult to directly adapt to the characteristics of geostationary microwave observations, this study aims to develop a specialized cloud detection method for GeoMWS to facilitate the effective assimilation of its data.Method This study proposes a fast and efficient cloud detection algorithm specifically tailored for the GeoMWS instrument. The method constructs a cloud detection index based on two window channels (Channel 2 and Channel 4). It fully accounts for the sensitivity differences of these channels across various latitudinal regions and introduces brightness temperature normalization to eliminate background noise caused by temperature variations. Furthermore, the optimal threshold for the cloud detection index is scientifically determined by analyzing the evolution characteristics of the index under different classification criteria.Result Evaluation results demonstrate that the proposed method achieves robust detection performance across different time periods. The Probability of Detection for clouds (POD_cld) and the Hit Rate (HR) both exceed 75%, indicating a high capability in identifying cloudy scenes. Meanwhile, the False Alarm Rate for clouds (FAR_cld) is effectively controlled below 20%. Compared to previous methods applied to this context (which maintain a detection rate of approximately 60%), the proposed method exhibits significant advantages in accuracy.Conclusion This study successfully develops a cloud detection algorithm optimized for GeoMWS. This method not only provides high-precision cloud detection information for data assimilation under both clear-sky and cloudy conditions but also offers reliable theoretical and methodological support for future operational applications. Particularly in critical areas such as typhoon monitoring and NWP data assimilation, it will significantly enhance the application effectiveness of geostationary microwave satellite data.  
    关键词:geostationary orbit;microwave sounding;geostationary microwave sounding;cloud detection over ocean;simulated data   
    20
    |
    11
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 158241281 false
    更新时间:2026-06-12

    Zhou Tianhao, Wang Yexin, Dell'Agnello Simone, Di Kaichang, Chen Shaohua, Salvatori Lorenzo, Guo Guangyan, Tibuzzi Mattia, Chen Qianglong, Rodriquez Raffaele, Campagnola Roberto, Lauretani Rudi, Zhou Yasong, Villalba Blanca, Chen Tianhao

    DOI:10.11834/jrs.20265539
    img
    摘要:The micro laser retroreflector array INRRI (INstrument for Landing-Roving Laser Retroreflector Investigations) is one of the international payloads onboard the Chang’e-6 mission. It receives laser signals emitted by the laser equipment onboard lunar orbiters and utilizes the characteristic of parallel reflection of incident laser on the reflective surface to achieve high-precision ranging. Through repeated observations and calculations, INRRI has become the very first absolute control point on the far side of the Moon, providing fundamental support for lunar geodesy, remote sensing mapping and positioning, high-precision orbit determination and navigation of lunar orbiters. This study presents mechanical structure and optical design of INRRI, and analyzes its key performance. In terms of design, a spherical dome structure and coating technology are adopted to enable INRRI to effectively receive the orbiter’s laser beam within a wider field of view. In terms of performance analysis, an effective reflection area model is established to analyze the reflecting signal intensity under different incident angles. The results show that the effective reflection area of a single corner cube reflector (CCR) reaches approximately 1 cm² under normal incidence, and INRRI can maintain stable effective reflection performance within a 60° half-aperture field of view. Furthermore, combined with the velocity aberration effect and far-field diffraction theory, the detectability of INRRI by the laser equipment onboard lunar orbiters is comprehensively evaluated. The dihedral angles of the CCR measured by a ZYGO interferometer are (0.37″, 0.64″, 0.37″), corresponding to a total beam deviation angle of approximately 3.01″. By establishing a far-field diffraction optical path and conducting simulation analysis, it is found that the far-field diffraction pattern exhibits spots with varying intensities distributed over an angular range of 40 μrad, thereby covering the calculated maximum velocity aberration offset angle of 11.01 μrad and satisfying the requirements of orbital observation and compensation. The observability of INRRI has been further confirmed by multiple successful detections using the Lunar Orbiter Laser Altimeter (LOLA) onboard the Lunar Reconnaissance Orbiter (LRO). This demonstrates the feasibility of employing micro laser retroreflector arrays as absolute control points on the lunar surface and provides a reference for the design optimization and future application of micro laser retroreflectors in subsequent deep-space exploration missions.  
    关键词:Laser retroreflector;laser ranging;effective reflection area;velocity aberration;far-field diffraction   
    96
    |
    17
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 157056440 false
    更新时间:2026-05-21

    DENG Zhigang, ZHAO Hongmei, ZENG Qingxuan, WANG Chenwei, PAN Pingping

    DOI:10.11834/jrs.20265424
    img
    摘要:Objective Hyperspectral reconstruction (HSR) from multi-spectral information is an ill-posed inverse problem. At present, hyperspectral reconstruction mainly focuses on the visible and near infrared spectral bands. And there is almost no research on thermal infrared HSR, due to the absence of hyperspectral thermal infrared remote sensing data, which limits the development of thermal infrared remote sensing. Simultaneously, previous studies often utilize existing satellite multispectral band and focus on the study of HSR models, while the issues of multispectral partitioning and band selection for HSR is seldom focused. We aim to explore the ideal thermal infrared multispectral partitioning and band selection method with a certain HSR model to improve the hyperspectral reconstruction precision of thermal infrared.Method We use in-situ measured 727 hyperspectral thermal infrared emissivity samples of seven land surface cover materials,inc. asphalt roads, marble, gray ground tiles, painted surfaces, green ground tiles, slate paths, and brick-concrete pavements and so on, in this study. The broadband thermal infrared emissivity is calculated from hyperspectral thermal infrared emissivity using energy conservation law,after the broadband range is determined by the following proposed method. We proposed several broadband partitioning methods, such as,traditional equal wavelength interval method, cluster analysis method based on the correlation coefficient between temperature and emissivity and optimization method based on Quantum Genetic Algorithm (QGA) and HSR model. Simultaneously,total seven HSR model, such as non-regular multiple linear regression (MLR) and stepwise linear regression (SLR), regularized ridge regression (RR), LASSO regression, and elastic network regression (ENR), non-linear Support Vector Machine regression (SVM) and neural network regression (NNR) are introduced to compare and analyze the effects of different broad band partitioning methods on above seven HSR models.Result Non-linear HSR models and non- regularized linear HSR models have the higher error than the linear regularized HSR models for the thermal infrared emissivity HSR. LASSO and ENR models are not sensitive to broadband partitioning methods, while RR is more sensitive to broadband partitioning methods. The average error of the linear regularized RR model is the smallest, and the maximum error of ENR is the lowest for the seven land cover material. The distribution of thermal infrared broadband affect the error variation of HSR results at the wavelength direction through the changes of central wavelength and band width. For example, the QGA-SLR broadband optimization results can improve the error difference of HSR at the wavelength direction and enhance the overall performance of HSR.Conclusion The performance of a certain HSR model could be improved by the optimal broadband selection method, which changes with the changes of HSR model. Simultaneously, the ideal thermal infrared broadband not only improve the comparability of multi-source thermal infrared remote sensing products but provide technical support for the research and development of thermal infrared remote sensing sensors. The optimized combination of broadband selection methods and HSR models provides methodological support for full band HSR.  
    关键词:Hyperspectral Reconstruction;Thermal Infrared Emissivity;Broadband;Quantum Genetic Algorithm;Machine Learning Method   
    63
    |
    16
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 157056362 false
    更新时间:2026-05-21

    HE Da, LI Zeyu, LIU Haoran, HU Xikun, ZHONG Ping, SHI Qian

    DOI:10.11834/jrs.20265303
    img
    摘要:Objective Fine-grained ship detection in optical remote sensing imagery is an important research topic in maritime monitoring and ocean observation. Compared with coarse-grained ship detection, fine-grained ship detection aims to distinguish specific ship types with highly similar visual appearances, which places higher demands on dataset scale, category diversity, and annotation accuracy. However, existing ship detection datasets are generally limited in the number of fine-grained categories, instance scale, and scene diversity, which restricts the performance and generalization ability of deep learning–based oriented object detection models. The objective of this study is to construct a large-scale fine-grained ship detection dataset and to further expand data diversity through controllable synthetic data generation, thereby providing a reliable data foundation and benchmark for fine-grained ship detection in optical remote sensing imagery.Method First, a large-scale fine-grained ship detection dataset, named LAFI, is constructed using high-resolution optical remote sensing images collected from 36 representative ports worldwide. The dataset contains 8,000 images acquired under diverse imaging conditions and complex maritime environments. A total of 49 fine-grained ship categories are defined, and 48,717 ship instances are manually annotated using oriented bounding boxes to accurately describe ship orientation and geometric characteristics. Second, to alleviate the limitations of real data in terms of scale and scene coverage, a controllable diffusion-based data generation framework is designed to extend LAFI into a million-scale synthetic dataset, referred to as LAFI-Diffusion. By incorporating structured textual prompts that describe scene types, weather conditions, and temporal information, the diffusion model is guided to generate realistic ship images under diverse maritime scenarios. The generated synthetic samples are further filtered and combined with real data to form large-scale training sets suitable for fine-grained ship detection. Finally, several representative oriented object detection methods are selected and evaluated on the constructed datasets to analyze the effectiveness of synthetic data augmentation and to establish benchmark results.Result Experimental results show that the proposed LAFI dataset provides improved category richness, instance scale, and scene diversity compared with existing fine-grained ship detection datasets. Moreover, incorporating synthetic data from LAFI-Diffusion into the training process consistently improves detection performance and generalization ability across different oriented object detection models. Performance gains are particularly evident in complex maritime environments, such as crowded ports and scenes with varying sea states and illumination conditions. The benchmark evaluations also indicate that the contribution of synthetic data varies across detection methods, suggesting that appropriate integration strategies are important for fully exploiting synthetic data.Conclusion This study presents a large-scale fine-grained ship detection dataset that integrates real optical remote sensing imagery with controllable diffusion-based synthetic data generation. By substantially expanding data scale and enhancing scene diversity, the proposed LAFI-Diffusion dataset effectively addresses the limitations of existing fine-grained ship detection benchmarks. The experimental results confirm that synthetic data can serve as an effective complement to real-world samples, improving detection accuracy and robustness for oriented ship detection models. The released datasets and benchmark results provide valuable support for future research on fine-grained ship detection and related remote sensing applications.  
    关键词:remote sensing imagery;ship detection;dataset;diffusion model;synthetic data generation;Fine-grained recognition;oriented bounding box;data augmentation   
    285
    |
    439
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 155030570 false
    更新时间:2026-04-21

    CHEN Hao, WU Yanhong, ZHENG Siqi, CHI Haojing, TENG Xuankai, LI Junsheng

    DOI:10.11834/jrs.20265422
    img
    摘要:Objective Shorelines of rivers and lakes constitute the critical interface between inland water bodies and adjacent terrestrial systems, functioning as key transitional zones that regulate flood storage, buffer hydrological extremes, and maintain aquatic–terrestrial ecological system. Accurate and timely monitoring of shoreline dynamics is therefore substantial for practical flood risk mitigation, water resource management, and the planning, protection, and restoration of freshwater ecosystems. This study aims to develop an effective and reliable deep learning (DL) model to identify shorelines from high-resolution remote sensing imagery, supporting systematic quantification of their spatiotemporal variation in response to climate and human intervention.Method Deep learning models based on the Swin-UNet architecture were set-up to identify water extents, boundaries of which were then vectorized to obtain the shorelines of the study area by using the Gaofen and Sentinel-2 remote sensing imagery respectively. The Swin-UNet was adopted in this study due to its performance in capturing both local details and long-range contextual information, enabling robust delineation of complex shoreline morphologies. Annual and monthly shorelines were retrieved from Gaofen and Sentinel-2 respectively for the period 2015–2025 based on the trained and well validated Swin-UNet models. Intra-annual and interannual variations of shorelines in terms of their length, development index, fractal dimension and composition were then assessed.Result The Swin-UNet models, based on both Gaofen (GF) and Sentinel-2 imagery, achieved a classification accuracy of water pixel exceeding 90%, with clear, continuous shoreline boundaries and well-preserved geometric details, significantly outperforming traditional thresholding and machine learning methods. Results based on Gaofen imagery show that the annual mean area of water extent in the study area was around 314.7 km², with a corresponding shoreline length of about 7,306.4 km. The results derived from Gaofen imagery were largely consistent (R2=0.99) with those from Sentinel-2. However, owing to its higher spatial resolution, the water area extracted from Gaofen imagery was about 18.9% larger than that from Sentinel-2, while the shoreline length was approximately 75% longer. Length of the shoreline showed stronger intra-annual variability (CV=12.8%) than inter-annual variability (CV=9.0%). Shorelines of artificial water bodies were found more morphologically stable than that of natural water bodies. Composition of shorelines in the study area exhibited a transition from semi-natural/semi-artificial to artificial shorelines, while natural shorelines remained relatively stable.Conclusion The study demonstrates that Swin-UNet–based deep learning approaches provide highly accurate, spatially detailed, and temporally consistent monitoring of river–lake shoreline dynamics, which is essential for effective management, protection, and restoration of inland aquatic environments. Changes in the composition of the shorelines in the study area were closely associated with urban spatial expansion and strongly influenced by a series of ecological restoration policies, including the “Grain for Green” program and wetland restoration initiatives.  
    关键词:river and lake shorelines;dynamic monitoring;Swin-UNet;Gaofen satellite;Sentinel-2;Chenzhou City   
    485
    |
    498
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 155030216 false
    更新时间:2026-04-21

    PENG Zimeng, DUAN Yuling, YU Qiangyi, Wu Wenbin, Zhang Shuai, Zhao Chunlei, Li Boliang, Zhang Xin

    DOI:10.11834/jrs.20265396
    img
    摘要:Objective Deep learning methods have demonstrated considerable potential for the fine-scale extraction of plot-level crop distributions from remote sensing imagery. However, their performance is heavily reliant on large volumes of high-quality annotated samples, leading to significant challenges, including high labeling costs, poor timeliness, and limited cross-regional generalization capability. These limitations are particularly pronounced in fragmented agricultural landscapes where plot boundaries are complex. This study aims to address these bottlenecks by proposing a novel framework that achieves accurate plot-level rice mapping without the need for region-specific model training and with minimal dependence on manual annotations. By integrating visual foundation models with agricultural prior knowledge, this research seeks to establish a robust, automated solution for crop mapping in diverse environments.Method We propose a plot segmentation prompt optimization method that couples the Segment Anything Model (SAM) with domain knowledge fusion. The core of our framework is an adaptive, iterative learning closed-loop system comprising "segmentation → screening → knowledge update → re-prompting." First, prior knowledge derived from time-series vegetation indices—specifically the Normalized Difference Vegetation Index (NDVI) and Land Surface Water Index (LSWI)—is applied to mask out non-vegetated areas, substantially reducing the computational load. The SAM is then employed for initial segmentation. Crucially, the framework performs statistical analysis on high-confidence segmented plots to dynamically update the thresholds of prior knowledge (e.g., spectral characteristics during flooding and peak growth stages, and geometric area thresholds). These updated thresholds are automatically translated into refined prompt information, including plot center points and boundary points, which are fed back into the SAM model to guide finer segmentation. Furthermore, we innovatively introduce the Intersection over Union (IoU) metric and area change rate as convergence criteria to automatically terminate the iterations, ensuring both algorithmic stability and computational efficiency.Result The proposed method was systematically validated across three geographically and agronomically diverse rice cultivation regions: Ninghe District (Tianjin, China), Fujin City (Heilongjiang, China), and Niigata City (Japan). The results demonstrated high mapping accuracy and strong generalization capabilities. In Ninghe, the method achieved an Overall Accuracy (OA) of 94.44%, a Kappa coefficient of 0.89, and an F1-score of 94.90%. Similarly, high performance was observed in Fujin (OA: 96.80%, Kappa: 0.91) and Niigata (OA: 94.50%, Kappa: 0.86). Comparative analysis of multi-temporal imagery identified the rice harvest period as the optimal phenological window for extraction due to maximized spectral and textural contrast. Ablation studies revealed that the iterative mechanism significantly improved the Kappa coefficient from 0.79 to 0.89 compared to the baseline. Moreover, in a direct comparison with typical supervised learning models (U-Net and DeepLabV3+) trained on local samples, our method achieved superior extraction accuracy and boundary completeness despite using no training samples whatsoever, proving its robustness against sample scarcity.Conclusion This study presents an effective framework for high-precision, plot-level rice mapping that significantly reduces dependency on annotated samples. By successfully coupling the powerful, generic segmentation capability of SAM with agricultural remote sensing prior knowledge through an adaptive iterative mechanism, the method overcomes the "sample dependence" bottleneck of traditional deep learning. It offers a promising technical pathway for large-scale, cost-effective, and automated crop mapping, providing essential data support for food security assessment and sustainable precision agricultural management.  
    关键词:Remote sensing fine extraction;SAM;prior knowledge fusion;rice mapping;adaptive statistical learning;parcel-level segmentation;iterative optimization;prompt information optimization   
    222
    |
    228
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 155030072 false
    更新时间:2026-04-21

    CUI Qunpeng, ZENG Jiangyuan, SHI Pengfei, ZHANG Chunlin, WANG Panshan, RONG Jiaming, MA Hongliang, BI Haiyun

    DOI:10.11834/jrs.20265313
    img
    摘要:(Objective)Soil moisture is a key regulatory factor in the energy exchange between multiple layers of the Earth system. Active and passive microwave signals provide complementary information due to their different responses to soil moisture, and integrating these two types of signals to generate high-accuracy soil moisture products with excellent spatiotemporal coverage is a current research focus and challenge. Mathematical error measurement approaches have shown great potential in this integration process, as they can derive product error metrics relative to true values under certain assumptions. However, limited research has compared the performance and discrepancies of these error measurement methods when combined with different fusion algorithms in terms of fusion product accuracy. This study aims to address this research gap.(Method)Three mathematical error measurement methods, namely the Extended Triple Collocation (ETC) method, Double Instrumental Variable (IVd) method, and Three-Cornered Hat (TCH) method, were employed to evaluate the passive microwave-based SMAP and active microwave-based ASCAT soil moisture products on a global scale, and their spatial distribution patterns of accuracy were revealed. Subsequently, three main fusion algorithms (the minimum random error variance method, the maximum correlation coefficient method, and the maximum signal-to-noise ratio method) were used to generate soil moisture fusion products. The impact of different mathematical error measurement methods on the performance of fusion algorithms was systematically analyzed.(Result)The results show that: 1) The spatiotemporal coverage of all fusion products is significantly enhanced. Among them, the combination of the TCH method and the minimum random error variance method outperforms the original single active and passive products in all error metrics (including unbiased root mean square error, root mean square error, and bias) except for the correlation coefficient (R); 2) Most fusion products exhibit superior performance compared to the ASCAT product and comparable accuracy to the SMAP product; 3) Except for the maximum signal-to-noise ratio method based on IVd, the fusion methods based on TCH are generally superior to those based on ETC and IVd.(Conclusion)This study clarifies the impact of different mathematical error metrics on the performance of soil moisture fusion products, providing a theoretical basis and methodological support for the fusion of active and passive microwave soil moisture products.  
    关键词:soil moisture;fusion algorithm;assessment method;microwave remote sensing;global scale;the minimum random error variance method;the maximum correlation coefficient method;the maximum signal-to-noise ratio method   
    282
    |
    183
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 153119833 false
    更新时间:2026-03-26

    REN Zihan, XU Jiaqi, WEI Shanshan, WU Wenbin, LI Wenjuan

    DOI:10.11834/jrs.20265479
    img
    摘要:Objective The real-time and precise monitoring of crop growth status at the field scale is a critical component for achieving modern precision agriculture. Multi-source remote sensing technologies, including satellite platforms and unmanned aerial vehicles (UAVs), have emerged as effective non-destructive tools for this purpose. However, these data sources present a significant spatiotemporal trade-off, limiting their independent utility. Satellite remote sensing, while offering broad-area coverage, is often constrained by adverse weather conditions and insufficient spatial resolution to capture in-field variability. Conversely, UAV remote sensing provides exceptionally high spatial resolution but is hampered by limited battery endurance, making large-scale continuous (e.g., daily) monitoring challenging. Consequently, a single data source is inadequate for supporting the continuous monitoring of crop growth at the field scale. To address this critical gap, this study proposes a cross-platform spatiotemporal fusion method. The objective is to synergistically integrate satellite and UAV data, effectively leveraging their complementary temporal and spatial resolutions to generate a continuous, high-resolution dataset for precision agriculture.Method This research was based on a synergistic combination of multi-platform, multispectral remote sensing data, including high-resolution UAV imagery, Sentinel-2 data and PlanetScope SuperDove data. We developed an improved CACAO (Consistent Adjustment of the Climatology to Actual Observations) algorithm, adapting its core logic for cross-platform data fusion rather than its original climatological application. Two distinct data combination strategies were designed and tested: (1) a baseline “UAV+Sentinel-2” strategy and (2) an enhanced “SuperDove+Sentinel-2+UAV” strategy, which integrates high-frequency commercial satellite data. The CACAO framework was implemented using two distinct modes: a “forward prediction” (FP) mode, designed for near real-time applications, and a “backward updating” (BU) mode, which iteratively refines historical estimates as new data becomes available. The final output of the framework is a near real-time, daily 1-meter resolution normalized difference vegetation index (NDVI) time-series dataset. The accuracy of the fusion results was rigorously evaluated using two methods: (1) Leave-One-Out Cross-Validation (LOOCV), which assesses the model’s predictive power, and (2) a benchmark comparison against the established GLM-STF (Generalized Linear Model-based Spatiotemporal Fusion) algorithm.Result The prerequisite for data fusion was confirmed as NDVI data from the different platforms exhibited good consistency, with a strong correlation between Sentinel-2 and SuperDove (R = 0.97) and a reliable correlation was observed between UAV and satellite data (R > 0.75). In addition, the CACAO algorithm was proven to effectively reconstruct the phenological dynamics of the rice crop. A key finding was that the backward updating (BU) mode produced a significantly smoother and more robust NDVI time series than the forward prediction (FP) mode. Both CACAO-based data combination strategies achieved high overall accuracy (R > 0.94). Critically, the study demonstrated that introducing high-temporal-resolution SuperDove data during key phenological stages can substantially improve accuracy, with the correlation increasing from 0.51 to 0.67 in a specific validation case. Finally, in the comparative analysis, the CACAO algorithm demonstrated greater stability and slightly higher accuracy than the GLM-STF algorithm, particularly showing more robust performance across the entire growing season.Conclusion In conclusion, the cross-platform fusion framework proposed in this study, centered on the improved CACAO algorithm, is an effective and robust solution for generating continuous, high-precision (daily 1-meter) field-scale rice NDVI time series. This approach successfully overcomes the limitations of single-source data platforms. The framework provides strong technical support for the fine-grained monitoring of crop growth and the implementation of precision management strategies in modern agriculture.  
    关键词:PlanetScope;Sentinel-2;spatiotemporal data fusion;growth monitor;precision agriculture;field scale;phenological curve;near real-time monitoring   
    437
    |
    707
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 152665691 false
    更新时间:2026-03-17

    CHEN Farong, YANG Guangrui, ZHAO Zhilong, SHI Kun, ZHAO Chu, MENG Lize, YE Zhishan, ZHANG Xinyi, HUANG Changchun

    DOI:10.11834/jrs.20265131
    img
    摘要:Particulate Organic Carbon (POC) is a critical component of the inland water carbon pool and plays a pivotal role in carbon transport, biogeochemical transformation, mineralization, and greenhouse gas emissions. Quantifying both the concentration and the source composition of POC is essential for understanding inland water carbon cycling and improving regional carbon budget assessments. However, the coexistence of endogenous and terrestrial sources creates substantial variability in optical and biochemical properties, limiting the accuracy and generalizability of existing remote sensing algorithms. This study aims to apply an approach that integrates biochemical source–tracing methods with satellite remote sensing to accurately estimate POC concentration and source composition across diverse aquatic environments. A biochemical end–member mixing model was first applied to water samples collected from the Yangtze River mainstem and Lake Taihu to quantify the contribution of endogenous and terrestrial POC. Based on these source apportionments, a three–band remote sensing reflectance ratio was identified as an effective optical indicator for resolving the proportion of endogenous POC. This ratio was used to classify water bodies into two categories characterized by dominant endogenous or terrestrial POC sources. Building upon this classification, a semi–analytical algorithm using inherent optical properties (IOPs) of the water column was applied to estimate POC concentration. Specifically, the particle absorption coefficient of pigments, aph(674), was used as a proxy for endogenous POC concentration, while the non–algal particle absorption coefficient, anap(443), was used to infer terrestrial POC concentration. Total POC concentration was then derived by integrating the two source–specific estimates according to their fractional contributions. The three–band remote sensing reflectance ratio demonstrated strong performance in estimating the proportion of endogenous POC in the water column, yielding an RMSE of 0.081, an MB of 0.0131, and a MAPE of 43.479%. Endogenous POC concentration estimated using aph(674) achieved an RMSE of 1.000 mg/L, an MB of -0.157 mg/L, and a MAPE of 24.455%. Similarly, terrestrial POC concentration predicted from anap(443) showed an RMSE of 0.346 mg/L, an MB of -0.012 mg/L, and a MAPE of 22.200%. When combined, the overall POC algorithm outperformed existing empirical and semi–analytical models, exhibiting an RMSE of 1.322 mg/L, an MB of -0.177 mg/L, and a MAPE of 29.380%. These results confirm that integrating optical classification with source–specific modeling significantly improves the robustness and generalizability of POC retrievals across heterogeneous inland waters. This study demonstrates that coupling remote sensing techniques with biochemical source–tracing provides a powerful framework for quantifying both POC concentration and source composition at broad spatial and temporal scales. The approach effectively leverages the mechanistic insights offered by biochemical methods and the large–scale observational capability of satellite remote sensing. The resulting algorithm enhances the interpretability and transferability of POC retrievals, offering a promising tool for advancing carbon pool monitoring in inland waters. However, the algorithm has not yet been validated beyond the Yangtze River mainstem and Lake Taihu regions. Future work will focus on acquiring additional datasets from diverse hydrological and optical environments to further refine model performance and assess its broader applicability.  
    关键词:inland waters;particulate organic carbon;isotopic tracing;n–alkanes;semi–analytical model   
    190
    |
    158
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 152668155 false
    更新时间:2026-03-13

    GUO Rui, FU Ben, ZHU Xiufang, SONG Junying

    DOI:10.11834/jrs.20265421
    img
    摘要:Objective Crop lodging poses a significant threat to agricultural productivity and food security, yet existing monitoring approaches often suffer from low automation, insufficient integration of pre- and post-disaster data, and a lack of systematic spatiotemporal coordination. To overcome these limitations, we developed an automated framework, StandardCurve-iForest-RF, which aims to establish a resilient crop growth baseline, distinguish true lodging from noise, and enable precise spatiotemporal mapping for disaster management.Method The approach utilizes time-series Sentinel-2 satellite data to construct a Crop Growth Standard Curve (CGSC) for the target crop. The Soft Dynamic Time Warping (Soft-DTW) algorithm is employed to create this curve, which serves as a resilient reference model capable of accommodating inter-annual climatic variations. To detect lodging, the method calculates cumulative multi-feature anomaly scores by comparing post-disaster satellite observations against the pre-established standard curve. An Isolation Forest (iForest) algorithm is applied for initial anomaly detection across multiple spectral features. Subsequently, a spatiotemporal joint decision mechanism is implemented to refine the results, effectively suppressing false alarms caused by persistent cloud cover, cloud shadows, and other environmental noise. Finally, a Random Forest (RF) classifier is used to accurately map the spatial extent and precise boundaries of the lodged areas, completing the fully automated workflow from dynamic detection to precise mapping.Result The method was validated using a case study of a lodging event that occurred on September 15, 2020, in Bohetai Township, Zhaoyuan County, Daqing City, Heilongjiang Province. It successfully identified the lodging event, demonstrating its effectiveness in distinguishing actual crop damage from noise. The overall detection accuracy reached 80.36%, with a Kappa coefficient of 0.60, confirming a substantial agreement between the automated detection results and ground reference data. The results clearly showed that the integrated use of the standard curve, the anomaly scoring mechanism, and the spatiotemporal decision rules significantly enhanced the reliability of lodging identification and minimized false positives. The entire process, from data processing to the final generation of the lodging map, was executed automatically without manual intervention.Conclusion The StandardCurve-iForest-RF framework presents a significant advancement in automated crop disaster monitoring. Its core innovation lies in the construction of a resilient growth standard curve and a sophisticated spatiotemporal analysis pipeline that effectively differentiates true lodging from interference. The successful application in a real-world case study confirms the method's practical utility and accuracy. This framework provides a valuable tool for agricultural departments and emergency management agencies, enabling rapid assessment of crop damage extent and supporting timely disaster response and loss estimation. The methodology is adaptable and holds promise for application in other regions and for monitoring other types of abrupt agricultural disasters.  
    关键词:Crop Lodging Monitoring;Standard Growth Curve;time-series analysis;Spatiotemporal Joint Detection;Sentinel-2   
    169
    |
    644
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 152665661 false
    更新时间:2026-03-13

    SUN Kaiping, ZHANG Jialong, TENG Chenkai, YANG Kun, HUANG Kai, LEI QiWang, XIONG Dengliang

    DOI:10.11834/jrs.20265246
    img
    摘要:Objective Accurately estimating the aboveground carbon stock of forests is crucial for forest management and the sustainable development of forest ecosystems. Since single machine learning models still face issues such as weak generalization, low estimation accuracy, and high uncertainty when estimating forest carbon stocks, this study aims to explore new hybrid models to improve the efficiency of model building and prediction accuracy, thereby enhancing the accuracy of aboveground forest carbon stock estimation.Methods Taking Forest Resources Class II Survey data and Landsat 8 OLI imagery as data sources, a feature selection method integrating Genetic Algorithm (GA) and CatBoost (Genetic Algorithm and CatBoost, GAC) was proposed, combining GA's global optimization capability in feature subset exploration with CatBoost's prominent strengths in mining nonlinear feature relationships and quantitatively evaluating feature importance; on this basis, GAC was systematically compared with the traditional Recursive Feature Elimination (RFE) method to screen remote sensing feature variables that effectively improve model accuracy, while the Hyperopt hyperparameter optimization algorithm was adopted to iteratively search the hyperparameter space of each machine learning regression model to obtain the optimal parameter combinations. Subsequently, a stacked ensemble AST regression algorithm based on base learner mean fusion and meta-learner-based adaptive weighting was constructed, which selects four optimized single machine learning models—Adaptive Boosting (AdaBoost), CatBoost, Random Forest Regression (RFR), and Light Gradient Boosting Machine (LightGBM)—as base learners to fully leverage the unique advantages of each model and significantly enhance the overall performance of the algorithm. Finally, remote sensing estimation models for carbon stock were established based on six single machine learning regression models (AdaBoost, CatBoost, RFR, LightGBM, Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost)) as well as the AST ensemble model, and after comprehensively comparing the estimation accuracies of all candidate models, the optimal one was selected to conduct high-precision inversion mapping of Pinus densata carbon stock in Shangri-La City.Results 1) RFE selected 9 variables, while GAC selected 7 variables, among which the 7 variables selected by GAC contributed more to the accuracy of Pinus densata AGC inversion; 2) Based on Hyperopt, the hyperparameters of each model were iteratively optimized, and it was found that the optimal feature subset selected by GAC, when combined with the AST algorithm for regression fitting, achieved the best estimation accuracy, with a coefficient of determination R² = 0.885, a root mean square error RMSE = 8.321 t/hm², and a prediction accuracy P = 86.4%; 3) Based on the optimal estimation model, the aboveground carbon stock of Pinus densata in Shangri-La City in 2016 was estimated to be 7.70953 million t, with an average carbon density of 40.015 t/hm²; 4) The directionality of texture features has a significant impact on the estimation accuracy of forest carbon storage by the AST model, and the 45° diagonal direction is the optimal direction for carbon storage estimation under this model.Conclusion The AST algorithm exhibits higher stability and anti-interference ability under multiple cross-validations, which effectively improves the estimation accuracy of nonparametric models and thereby reduces model uncertainty. This method can provide an effective reference for the dynamic monitoring of forest resources in other high-altitude areas.  
    关键词:Carbon stocks;Hyperopt hyperparameter tuning;machine learning;AST;Pinus densata;GAC;remote sensing inversion;uncertainty analysis   
    182
    |
    503
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 152665584 false
    更新时间:2026-03-13

    GE Yun, CHEN Jinliang, WEN Ning, CEN Yubo, WANG Anni, WANG Ting

    DOI:10.11834/jrs.20265395
    img
    摘要:In the field of computer vision, lightweight target detection in remote sensing images is an academically challenging research topic. With the continuous improvement of the accuracy of remote sensing image target detection, the complexity of the model has increased significantly accordingly. The current remote sensing image target detection methods find it difficult to achieve an effective balance between accuracy and model complexity. Unlike conventional natural images, remote sensing images have a large imaging area, limited available information, and a multitude of target types. The vast amount of background information can easily interfere with detection, and the excessive number of target categories makes it difficult to achieve high-precision detection of specific targets. Secondly, targets in remote sensing images are densely packed, arbitrarily rotated, and exhibit significant scale variations, posing a great challenge for target localization. Furthermore, as the accuracy continues to improve in remote sensing image target detection, models have become increasingly complex in pursuit of high precision. These complex models’ heavy computational load and large parameter count severely restrict practical application on resource-constrained devices, model lightweighting has become a key factor in promoting technological application. To address the aforementioned issues, this paper proposes a lightweight remote sensing image target detection method based on Spatial-Channel Reconstruction, aiming to maintain high-precision detection of circular and rectangular-like targets while reducing model complexity. Firstly, in response to the redundant information within feature maps at the spatial dimension, a Spatial Reconstruction Unit (SRU) is adopted. It separates feature maps based on the richness of spatial features into groups with abundant spatial information and groups with redundant spatial information, and then performs cross-reconstruction operations on the two groups to reduce spatial feature redundancy and enhance the spatial feature representation of remote sensing image targets. Subsequently, to address the redundancy of channel information in feature maps, a Partial Convolution-based Channel Reconstruction Unit (PCRU) is proposed. It divides feature maps into two parts along the channel dimension, one part uses partial convolution for efficient feature extraction, and the other part uses pointwise convolution to obtain hidden detailed information. The two parts are weighted and reconstructed, and then concatenated to extract features at a lower computational cost and enhance the representation of important channels. The lightweight method based on Spatial-Channel Reconstruction is applied to ACRFNet, an adaptive circular receptive field network for remote sensing image target detection. Experimental results on the NWPU VHR-10, DIOR and HRRSD datasets verify its performance. Specifically, the method can maintain or even improve detection accuracy, while significantly reducing the model’s computational load and parameter volume. Compared with other mainstream methods for remote sensing image target detection, the proposed method demonstrates obvious advantages in balancing detection accuracy and model complexity, especially for circular and rectangular-like targets that are difficult to detect. The lightweight remote sensing target detection method based on Spatial-Channel Reconstruction effectively solves the core problem of balancing detection accuracy and model complexity. By reducing feature redundancy, it overcomes the deployment limitations of complex models on resource-constrained devices, retains good adaptability to typical targets, and provides a reliable technical support for the practical application of remote sensing target detection technology.  
    关键词:remote sensing imagery;object detection;circular target;model lightweighting;reconstruction unit   
    165
    |
    160
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 151982008 false
    更新时间:2026-03-09

    PENG Yilin, FU Yingchun, XING Hanfa, CHEN Shuqi, LI Zhenhao, ZHANG Si

    DOI:10.11834/jrs.20255171
    img
    摘要:Objective Street view imagery (SVI) has emerged as an important geospatial big data source for perceiving the urban built environment. Accurately detecting facade-level changes and identifying their semantic categories is essential for monitoring urban renewal dynamics. However, existing change detection approaches struggle to separate temporal ownership of changed objects (change decomposition) and to directly provide semantic change information, leading to complex workflows and high data preparation costs. This study aims to develop a weakly supervised semantic change detection framework that integrates change decomposition and semantic labeling, and to apply it to dynamic mapping of urban renewal in Guangzhou, China.Method We propose Cross-C2PO, a novel dual-branch architecture designed to achieve end-to-end weakly supervised change decomposition. Unlike traditional single-branch models, Cross-C2PO introduces a cross-comparison mechanism to explicitly model asymmetric temporal differences, ensuring completeness and consistency regardless of input order. The model integrates a differentiable union operator to maintain consistency constraints during weak supervision and employs the proposed Cross-MTF feature fusion function to break commutativity for accurate temporal differentiation. Building on Cross-C2PO outputs, we design a semantic change detection workflow that leverages state-of-the-art segmentation models (e.g., DeepLabV3+) without requiring synthetic datasets. Finally, we introduce an urban renewal dynamic index to quantify facade-level changes and visualize renewal patterns across panoramic and directional views (front, back, left, right) for Guangzhou’s central districts (2013–2019), based on 11,431 pairs of Baidu street view panoramas.Result Traditional change detection methods fail to achieve end-to-end change decomposition and rely on complex multi-stage pipelines, limiting scalability and flexibility. In contrast, the proposed Cross-C2PO framework enables one-stage weakly supervised change decomposition and can seamlessly integrate with mainstream architectures, granting them both improved detection accuracy and the ability to perform temporal ownership splitting without additional labels. Experiments on multiple benchmark datasets demonstrate that our method consistently achieves state-of-the-art performance, outperforming existing approaches in both binary change detection and decomposition tasks. Ablation studies further validate the contribution of the cross-branch structure, Cross-MTF fusion, and the differentiable union operator. Applied to Guangzhou street view imagery, the workflow successfully produced urban renewal dynamic maps, revealing high-intensity updates clustered in Liwan and Baiyun industrial areas, while moderate changes dominate residential zones. Directional view analysis additionally highlights local disparities and micro-scale renewal patterns.Conclusion The proposed Cross-C2PO framework offers a simple yet effective solution for weakly supervised semantic change detection, enabling accurate change decomposition without additional synthetic labels. Combined with an interpretable urban renewal dynamic index, it provides a scalable and cost-effective approach for urban facade change analysis. This study bridges street view imagery and AI-based computer vision for urban analytics, offering new insights into spatiotemporal renewal dynamics. Future work will focus on optimizing computational efficiency and extending the method to multi-source data integration for large-scale applications.  
    关键词:urban renewal;street view imagery;semantic change detection;scene change detection;weak supervision;dynamic index   
    351
    |
    561
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 151841317 false
    更新时间:2026-03-06

    YIN Wenjie, WANG Xuelei, WANG Chen, WANG Hang, HUANG Caisheng, ZHAO Ruixue, MENG Fanle, LIU Jinxiu

    DOI:10.11834/jrs.20265294
    img
    摘要:(Objective)River discharge is a pivotal variable within the hydrological cycle, holding significant importance for flood warning, water resource allocation, and eco-environmental management. Traditional ground-based methods are limited by sparse station distribution and high costs of data acquisition, particularly in areas with complex terrains or remote regions, making it difficult to meet the demands of precise water resource management. Satellite remote sensing technology offers extensive coverage and high spatiotemporal resolution, providing new data sources and methodologies for river discharge monitoring. Machine learning (ML) approaches can accurately simulate complex relationships between river discharge and multiple driving factors, offering novel avenues for processing intricate hydrological data and optimizing models. By integrating machine learning algorithms with remote sensing and in-situ river discharge, it can provide the innovative measure for the efficient and reliable of river discharge monitoring.(Method)This study selected the Tangnaihai Hydrometry Station as the study area, and proposed a river discharge monitoring method by integrating satellite remote sensing and ML methods. Firslty, the Sentinel-2 imagery was utilized to extract river water surface width based on the Google Earth Engine cloud platform. The GLDAS v2.2 model-simulated five variables were served as predictor variables, namely evapotranspiration, soil moisture, temperature, terrestrial water storage and runoff. Discharge monitoring models were subsequently developed based on four statistical methods (linear function, power function, exponential function, and polynomial function) and four ML algorithms (XGBoost, Random Forest, LightGBM, and CatBoost). The discrepancies among different models were assessed, and the Shapley Additive Explanation (SHAP) method was employed to quantify the importance of different input variables.(Result)The results demonstrate that the polynomial function model demonstrates superior performance over other three statistical models during the testing period, with an R²of 0.67, and its error metrics (MAE: 319.01 m³/s, RMSE: 393.14 m³/s) were lower than those of the other three statistical models. Compare to traditional statistical approaches, ML models exhibit significant improvements overall in both simulation accuracy and stability, and the coefficient of determination (R2) increased by 46.15%, while the root mean square error (RMSE) and mean absolute error (MAE) decreased by 54.61% and 55.65, respectively. Notably, the Random Forest model achieved the optimal performance in the testing phase, with the R2 of 0.96, Nash-Sutcliffe efficiency coefficient (NSE) of 0.89, RMSE of 172.81 m3/s, and MAE of 147.33 m3/s, reflecting robust generalization capability and stability. SHAP analysis revealed that water surface width contributed most significantly to the discharge monitoring model (189.02), followed by soil moisture (145.11) and temperature (97.41). The runoff variable exhibited the minimal degree of influence on the river discharge monitoring model with the value of 14.14%.(Conclusion)This study confirms the feasibility and superiority of integrating satellite remote sensing and ML approaches for high-accuracy discharge estimation in regions characterized by complex topography and data scarcity. Future work could be optimized by integration of higher-resolution satellite imagery and mechanistic models with physical processes.  
    关键词:satellite remote sensing;machine learning;statistical models;SHAP method;discharge monitoring;Tangnaihai Hydrometry Station   
    285
    |
    697
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 151055693 false
    更新时间:2026-02-27

    Wang Xingbin, Zhou Guangyao, Zhang Peng, Ye Jinzhou, Zhang Hongsheng, Geng Xiurui, Ji Luyan

    DOI:10.11834/jrs.20255092
    img
    摘要:Objective This study aims to overcome the limitations of existing public water body datasets, such as single temporal resolution and low annotation accuracy. The objective is to construct a high-quality, multi-temporal lake extraction dataset based on the high-resolution wide-field multispectral imagery from the "GF-1" satellite, which offers improved temporal and spatial coverage of water bodies.Method To achieve this, three study areas with varying levels of dynamic change were selected: Poyang Lake (high dynamic change), Namtso Lake (moderate dynamic change), and Yangcheng Lake (low dynamic change). These areas were covered in four seasons of 2022. The GF-1 wide-field multispectral imagery underwent preprocessing, including radiometric correction, orthorectification, and quick atmospheric correction. For the annotation process, a hybrid strategy combining automated methods with manual visual interpretation was employed to ensure high annotation accuracy.Result The resulting dataset is characterized by multi-temporal data and high annotation accuracy, offering a significant improvement over existing datasets. The overall accuracy of the dataset for all three study areas and across all four seasons exceeded 94%.It provides reliable data for dynamic water body mapping and change monitoring across seasonal variations. Additionally, various water body extraction methods, including threshold segmentation, traditional machine learning algorithms, and deep learning techniques, were employed to validate the dataset’s practical utility. The results demonstrated that the dataset supports the effective training and evaluation of these methods.Conclusion The findings indicate that the constructed multi-temporal lake extraction dataset is highly reliable and can effectively support various water body extraction methods. It provides a robust data foundation for enhancing the performance of dynamic water body extraction algorithms, and contributes valuable data for research in dynamic water body monitoring and mapping using high-resolution remote sensing imagery.  
    关键词:Gaofen-1;Dynamic Water Body;water body extraction;dataset;feature extraction   
    419
    |
    600
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 140755005 false
    更新时间:2025-12-11

    ZHAO Jiawen, ZHOU Chan, XU Caixia, ZHANG Yuxiang, SUN Liqun

    DOI:10.11834/jrs.20254483
    img
    摘要:Objective Quantitative analysis of the coupling coordination relationship between the ecological environment and socioeconomic development in China's poverty-alleviated counties, along with the identification of influencing factors, is of great significance for summarizing poverty alleviation experiences and consolidating the achievements of poverty eradication efforts.Method Based on long-term and high-resolution remote sensing datasets, this study constructed a comprehensive evaluation index system for both the ecological environment and socioeconomic development across 832 poverty-alleviated counties in China. It quantitatively assessed the development levels and coupling coordination status of these two dimensions in 2010, 2015, and 2020. Furthermore, a Boosted Regression Tree model was employed to identify the contribution rates of various indicators to the regional coupling coordination development.Result The results indicated that, overall, the ecological environment in poverty-alleviated counties exhibited a higher level of comprehensive development compared with socioeconomic factors. However, the socioeconomic development progressed at a faster pace than ecological improvements. In addition, the average annual growth rates of both dimensions from 2015 to 2020 were higher than those from 2010 to 2015. Spatially, the coupling coordination degree was the highest in northeastern counties and the lowest in the northwest, showing a distribution pattern of “high in the east and low in the west” and a trend of progressive improvement from coastal to inland areas. Most counties were categorized as “economically lagging”. Among all indicators, population size, gross domestic product, and nighttime light intensity made particularly significant contributions to the coupling coordination.Conclusion Drawing from the poverty alleviation paths and practical experiences of typical regions, the study concludes that implementing context-specific industrial poverty alleviation strategies is crucial for accelerating socioeconomic development in poverty-alleviated counties. Establishing diversified, locally distinctive industries is identified as a key approach for consolidating poverty alleviation achievements and preventing a return to poverty.  
    关键词:poverty-alleviated counties;ecological environment;socioeconomic development;Remote sensing dataset;coupling coordination;boosted regression tree model;contribution rate;spatial distribution pattern;industrial poverty alleviation   
    297
    |
    263
    |
    0
    <HTML>
    <L-PDF><Enhanced-PDF><Meta-XML>
    <引用本文> <批量引用> 133190068 false
    更新时间:2025-11-04
0