arxiv_cs_cv 2026年4月24日

SparseGF: コンテキスト圧縮付き高所感知型スパース分割フレームワークによる、都市景観から自然景観までの頑健な地上分類

SparseGF: A Height-Aware Sparse Segmentation Framework with Context Compression for Robust Ground Filtering Across Urban to Natural Scenes

Open original article

Translated: 2026/4/24 19:43:16

deep-learningground-filteringpoint-cloudlsar3d-mapping

Japanese Translation

arXiv:2604.21356v1 発表タイプ：新作要約：航空レーザースキャニング（ALS）データから導き出された高品質なデジタル地形モデルは、多種多様な地理空間解析において不可欠であり、その生成には点群を異なる景観において地上と地上以外の部分へ分離するための頑健な地上分類（GF）が通常使用されています。現在の深層学習に基づく GF メソッドは、特に特定の困難な地形において印象的な性能を示していますが、それらの横展開的汎化能力は、計算リソースの制約による大規模処理におけるコンテキスト詳細のジレンマと、分類のみで最適化された際に高所物体から生じるランダムな誤分類という、継続的な 2 つの問題によって制限されています。これらの限界を克服するために、コンテキスト圧縮を強化した高所感知型スパース分割フレームワークの「SparseGF」を提案しました。この手法は、以下の 3 つの主要な革新的要素に基づいています：(1) 膨大なコンテキストをコンパクトな表現へと濃縮し、かつ中心部の詳細を保持する凸鏡をインスパイアしたコンテキスト圧縮モジュール；(2) コンテキスト圧縮を誘発する幾何学的歪みを軽減しつつ効率的に圧縮された表現を解釈するハイブリッド型スパースボクセル・ポイントネットワークアーキテクチャ；および (3) 訓練中にランダムな高所物体の誤分類を抑制するために明示的に地形標高の先验を強制する高所感知型損失関数。2 つの大規模 ALS ベンチマークデータセットにおける広範な評価により、SparseGF が都市から自然の地形まででの頑健な GF を達成することが示されました。具体的には、複雑な都市シーンにおける卓越した性能、混合地形における競争力のある結果、そして密度の高い森林斜面地域における中程度だが大災害性のものではない精度が得られました。本研究は、深層学習に基づく GF 研究に新たな洞察を提供し、大規模環境監視のための真に横展開的汎化に向けたさらなる探求を促します。

Original Content

arXiv:2604.21356v1 Announce Type: new Abstract: High-quality digital terrain models derived from airborne laser scanning (ALS) data are essential for a wide range of geospatial analyses, and their generation typically relies on robust ground filtering (GF) to separate point clouds across diverse landscapes into ground and non-ground parts. Although current deep-learning-based GF methods have demonstrated impressive performance, especially in specific challenging terrains, their cross-scene generalization remains limited by two persistent issues: the context-detail dilemma in large-scale processing due to limited computational resources, and the random misclassification of tall objects arising from classification-only optimization. To overcome these limitations, we propose SparseGF, a height-aware sparse segmentation framework enhanced with context compression. It is built upon three key innovations: (1) a convex-mirror-inspired context compression module that condenses expansive contexts into compact representations while preserving central details; (2) a hybrid sparse voxel-point network architecture that effectively interprets compressed representations while mitigating compression-induced geometric distortion; and (3) a height-aware loss function that explicitly enforces topographic elevation priors during training to suppress random misclassification of tall objects. Extensive evaluations on two large-scale ALS benchmark datasets demonstrate that SparseGF delivers robust GF across urban to natural terrains, achieving leading performance in complex urban scenes, competitive results on mixed terrains, and moderate yet non-catastrophic accuracy in densely forested steep areas. This work offers new insights into deep-learning-based GF research and encourages further exploration toward truly cross-scene generalization for large-scale environmental monitoring.