cv
Basics
| Name | David Pascual-Hernández |
| Label | PhD Student and Research Associate @ URJC | AI | Computer Vision | Autonomous Driving |
| d.pascualhe@urjc.es | |
| Summary | Computer vision researcher dedicated to advancing perception for autonomous driving in off-road environments. Throughout my career, I've been hands-on with the entire vision system lifecycle, from image capture to deployment, including leading an R&D team at SEDDI to bridge the gap between AI and physical applications for textile materials digitization. Currently pursuing a PhD at Rey Juan Carlos University, my research focuses in RGB, LiDAR, and fusion-based semantic segmentation for scene understanding in unstructured environments. Outside of the lab, you’ll usually find me exploring nature! |
Work
-
2024.10 - Present PhD Student and Research Associate
Universidad Rey Juan Carlos
Perception for autonomous driving, focusing on RGB, LiDAR, and fusion-based semantic segmentation for scene understanding in off-road environments.
-
2023.01 - 2024.10 Optical Team Lead
SEDDI
Responsible for overseeing the daily operations of a small team of R&D engineers, focusing on planning, reviewing, and delivering complex computer vision projects. These projects leveraged advanced, deep learning-based solutions. Alongside my leadership duties, I maintained an active role in the development process, ensuring hands-on involvement from concept to implementation.
-
2018.08 - 2023.01 Computer Vision Engineer
SEDDI
As a computer vision engineer at SEDDI, I contributed to the development of its flagship product, textura.ai, from its inception. Joining the company at a very early stage allowed me to be involved in every step of the process. I contributed to designing a complex multi-camera and multi-illumination digital material acquisition setup, developed tools to extract textile properties using both classic computer vision and deep learning methods, and implemented solutions for color constancy and calibration across various imaging devices.
-
2017.10 - 2018.08 Immersive Video Researcher
Nokia Bell Labs
As an immersive video researcher, my internship primarily focused on the field of Human-Computer Interaction, specifically on developing user interfaces based on hand gesture recognition. Additionally, I became familiar with augmented, mixed, and virtual reality concepts and techniques.
-
2016.08 - 2017.01 Computer Vision Engineer
Bosch
As a machine vision engineer intern, I provided support to the Process Engineering Department by engaging in a variety of tasks. These included studying Data Matrix reading systems and optical lenses, as well as developing an optical calculator. Additionally, I diagnosed and designed computer vision systems and developed computer vision applications with Neurocheck.
Volunteer
-
2023.11 - Present -
2020.06 - Present -
2017.09 - Present
Education
-
2024.10 - Present -
2017.12 - 2020.12 -
2016.12 - 2017.06 -
2012.12 - 2017.12
Certificates
| Linguaskill General - Score 180+ (C1 or above) | ||
| Cambridge International Education |
| Full Stack Deep Learning - Spring 2021 | ||
| Full Stack Deep Learning |
Publications
-
2026.05.07 Cross-dataset Evaluation of Visual Semantic Segmentation Models for Off-Road Autonomous Driving
Expert Systems with Applications
Intelligent autonomous driving in off-road environments is an emerging field with great potential to impact areas such as agriculture, forestry, and rescue operations. Perception in these scenarios presents unique challenges due to the diversity of elements and weather conditions, along with the inherent ambiguity in class definitions. Consequently, off-road visual semantic segmentation datasets remain underdeveloped, roughly ten times smaller than their urban counterparts, hindering dependable performance assessment and potentially compromising the safety of autonomous systems. To address these challenges, we present a comprehensive cross-dataset evaluation of visual semantic segmentation models for autonomous off-road navigation. We propose a unified ontology that harmonizes class definitions across relevant datasets, enabling their combination for both training and testing. This approach ensures fair model comparisons and reliable assessment of generalization to unseen domains. We further benchmark models on the original datasets, analyze the impact of different ontology harmonization criteria and conversion strategies, and evaluate the trade-off between segmentation performance and computational cost. Results show that Transformer-based architectures achieve the most consistent segmentation performance across datasets. While often computationally demanding, some variants maintain real-time inference (≈12 ms) with top-tier accuracy. The unified ontology simplifies the segmentation task, yielding more reliable models and about 40% faster training convergence. Cross-dataset training further enhances generalization, improving mean IoU by up to +20% on RUGD and +13% on WildScenes compared to RELLIS-3D-only training. Overall, this study provides valuable insights for developing robust perception modules for off-road autonomous vehicles.
-
2022.05.12 Efficient 3D human pose estimation from RGBD sensors
Displays
Human pose estimation is a core component in applications for which some level of human–computer interaction is required, such as assistive robotics, ambient assisted living or the motion capture systems used in biomechanics or video games production. In this paper, we propose an end-to-end pipeline for estimating 3D human poses that works in real-time in an off-the-shelf computer, using as input video sequences captured with a commercial RGBD sensor. Our hybrid approach is composed of two stages: 2D pose estimation using deep neural networks and 3D registration, for which a lightweight algorithm based on classic computer vision techniques has been developed. We compare several 2D pose estimators and validate the performance of our proposed method against the state-of-the-art, using as benchmark an international and publicly available dataset. Our 2D to 3D registration module alone can reach frame rates of up to 99 fps, while achieving an average error per joint of 132 mm. Furthermore, the proposed solution is agnostic to the model used for 2D pose estimation and can be upgraded with new upcoming solutions or adapted for different articulated objects.
Skills
| Computer Vision |
| Deep Learning |
| Artificial Intelligence |
| Autonomous Driving |
| Python |
| Linux |
| Docker |
| PyTorch |
| Robotics |
| LaTeX |
| Machine Learning |
| Open-source |
| Git |
| OpenCV |
Languages
| English | |
| Full Professional |
| Spanish | |
| Native Speaker |