For autonomous vehicles to operate safely in real-world environments, they must go beyond object
detection and understand semantic relations and situational context within a scene. In particular,
dilemma situations involving conflicting constraints — such as accident avoidance, pedestrian priority,
and traffic rule compliance — are difficult to resolve using conventional 3D Scene Graphs (3DSGs),
which mainly represent spatial structure.
To address this limitation, this paper proposes a Knowledge Graph (KG)-enhanced semantic scene
understanding framework tailored to autonomous driving dilemma scenarios. The proposed KG represents
not only objects, attributes, and relations, but also traffic rules, class hierarchies, commonsense
context, and scenario-specific semantic descriptions in a structured form.
We evaluate the framework using 30 scene-understanding queries across six cognitive categories under
a controlled LLM-as-judge setting (GPT-4o-mini for answering, GPT-4o for judging; N=300). Results
show that the KG-based method significantly outperforms the 3DSG baseline in reasoning quality
(mean 4.41 vs. 3.56, Wilcoxon p<0.001, Cohen's d=0.84), with the largest gains in
dilemma reasoning (+1.71) while spatial queries confirm design fairness (−0.07). A five-condition
ablation study reveals that natural-language semantic descriptions (rdfs:comment) are the dominant
contributor to performance.