P I X E V I A : A I B A S E D , R E A L - T I M E C O M P U T E R V I S I O N S Y S T E M F O R D R O N E S Mindaugas Eglinskas, CEO at PIXEVIA www.pixevia.com
P I X E V I A : A I B A S E D , R E A L - T I M E C O M P U T E R - - PowerPoint PPT Presentation
P I X E V I A : A I B A S E D , R E A L - T I M E C O M P U T E R - - PowerPoint PPT Presentation
P I X E V I A : A I B A S E D , R E A L - T I M E C O M P U T E R V I S I O N S Y S T E M F O R D R O N E S Mindaugas Eglinskas, CEO at PIXEVIA www.pixevia.com Origins in R&D projects for Lithuanian MoD. Autonomous systems research
Origins in R&D projects for Lithuanian MoD. Autonomous systems research at Vilnius University (2004 – 2017)
4
5
P I X E V I A
CORE
Real-time imaging
CORE X1
Hardware interfaces
OBJECTS
AI based object recognition
NAV
Navigation
FUSION
Information / sensor fusion
Integrates AI technology in daily commercial drone operations. Uses NVIDIA Jetson TX1 and PIXEVIA for: First responder
- Person detection & tracking
- Vehicle detection & tracking
- License plate recognition
- Privacy masking
Inspection
- Object detection / Classification
- Recognizing condition deviations
- Determine fault location
- Distance measurement
Real-time onboard mapping
Drones with AI for security applications. Uses NVIDIA Jetson TX1 and PIXEVIA system: carrierboard, pipelines and
- bject classification.
Fully autonomous control Fully autonomous information processing Fusion of information from different machines and data sources
Main driving forces powered by machine learning
U S E C A S E S F O R A I P O W E R E D D R O N E S
Surveillance: defence / law enforcement / private security
cars (with number plate recognition) trucks boats people heavy machinery
Automated infrastructure inspections
Automated inspection: power lines utility poles insulators foreign objects
Inventory management
Containers Wagons Locomotives Cars Materials Packages
Smart city
Parkings Car flows Persons Security
S O F T W A R E A R C H I T E C T U R E / D E C I S I O N S / L E S S O N S L E A R N E D
CORE
Intelligent real-time imaging: collection, transformation, communication
Interfaces with sensors and file formats (input) On-board image processing. Image processing pipelines Image processing modules Geographical metadata Distributed processing Industry standards
OBJECTS
…
Object detection, properties of objects (size, speed, coordinates)
Cars / Trucks License plate recognition People Face detection Other objects: boats, environment, etc.
NAVIGATION
Visual position estimation
Visual odometry Image-map matching Foreign object detection on the landing site
INFORMATION FUSION
…
Fusion of information in real-time
Information fusion from different sources (drones, cameras) Vizualization on 3D map
USB3 / USB2 SD-card UART HDMI GPIO CAN miniPCIe CSI I2C SPI PWM Accel, gyro, compass, barometer
- 1. Interfaces
Long range digital datalink (Microhard) Industrial block camera GPRS datalink Thermal imaging camera Gimbal control Ultra fast camera for visual
- dometry
Wifi , local communcations
- 2. Sensors
Spaghetti type of integration will kill any bigger project OpenVX / Gstreamer NVIDIA provides accelerated Gstreamer modules for encoding DDS for real time communication Processing can be changed before the mission or during the flight
- 3. Modular architecture
- 4. Simple description of image processing pipeline and tools
- 5. Geographical metadata
Every video frame contains geographical information:
- image corners with coordinates, position of
drone/camera, angles of sensors, camera geometry. Allows integration with GIS systems, provides data for later learning.
HARDWARE NVIDIA Frameworks Other frameworks
- 6. Hardware and software frameworks used
- 7. GPU based moving object detection
VisionWorks (OpenVX graph) CUDA OpenCV with CUDA optimization Caffe cuDNN
- 8. Region detection with convolutional neural networks
cuDNN Caffe Fully convolutional network Single shot detection Filtering after detection
- 9. Self-adaptation mode - “sleep”
Real-time Sleep
Self-adaptation during the sleep
Information from the fast neural models Slow models (big neural nets,
- ther computer vision
algorithms, physics)
- 10. Simulation and learning
31
MobilEye
600 people doing labeling
MobilEye photo
AI (neural networks / SGD) limitations
- It can win all games
(if can play more than human plays through all life)
- Recognize images
(if can see more than human sees through all life) Terrible performance with small datasets
Simulated data from unity3d
Simulation for data fusion
- 11. 3D reconstructions during the flight
210 000 points, 90 seconds on Jetson TX1, 4 images
- 12. Visual position estimation
Visual odometry Image - map matching Terrain segmenation
Deep neural networks 500 000 training item dataset. Convolutional neural network, cuDNN
Multiple hyphothesis tracking
- 13. User interfaces in the embedded system
Web / Qt via HDMI and datalinks
PIXEVIA version 0.5 Technology preview
Current status
AI for autonomous systems
CORE
Real-time imaging
CORE X1
Hardware interfaces
OBJECTS
AI based object recognition
NAV
Navigation
FUSION
Information / sensor fusion
P I X E V I A : A I B A S E D , R E A L - T I M E C O M P U T E R V I S I O N S Y S T E M F O R D R O N E S Mindaugas Eglinskas, CEO at PIXEVIA www.pixevia.com