Daniel Dworakowski

Teaching Computers How to Beep Boop

About Me

Hello! I am a MASc candidate in Mechanical and Industrial engineering in the ASBLab under the supervision of Professor Goldie Nejat at the University of Toronto. I completed my undergraduate degree at the University of Waterloo obtaining a BASc in Mechatronics Engineering. My research focuses on the use of computer vision to enable robots to intelligently navigate complex environments. This has led to research in both end-to-end navigation approaches and decomposed approaches relying on object detection methods.

Experience

ASBLab

Research Assistant

September 2018 - January 2021

Created a weakly supervised segmentation label generation method improving over prior literature by 20%.
Designed a real-time instance segmentation CNN achieving state of the art performance for single stage detection on ICDAR-15.
Architected a novel robotic system capable finding objects in an unknown environment using human cues.
Online generation of annotated occupancy grids for intelligent planning using OCR and SLAM.
Reimplemented various published works related to meta learning, RL, and computer vision using PyTorch.
Supervised three student thesis projects involving one-shot learning for object detection and learned indoor navigation
Teaching assistant for MIE443 - Mechatronics Systems: Design & Integration.

NVIDIA

Machine Learning Software Engineering

May 2015 - August 2015

Researched, trained, and evaluated novel imitation learning CNNs for lane keeping and speed control in self-driving cars.
Developed a software in the loop simulator to evaluate trained models and streamline CNN deployment.
Collaborated to create data collection and labeling systems for end-to-end self-driving reducing processing to one-step.
Accelerated CNN training and simulation speed over \textbf{30\%} using GPU hardware decodin.
Implemented motion controllers for path following and speed control to test experimental CNN models.
Real-time visualization and data manipulation to improve the interpretability of the CNN and the vehicle.

NVIDIA

Graphics Software Engineer

May 2015 - August 2015

Evaluated system wide frame to frame latency statistics and latency perceptibility resulting in a system testing QA framework.
Automated system rendering performance evaluation to autonomously quantify system KPIs.
Helped drive system-wide optimization and system quality assurance testing for Android products.

Agfa Graphics

Computer Engineer

September 2014 - December 2014

Drove the development of a module to rapidly introduce new printer sensors and actuators using just a config file.

Wind Energy Group at the University of Waterloo

Wind Energy Research Assistant

January 2014 - April 2014

Assisted in the development of experimental apparatus and calibration tools.

Publications

Robots Understanding Contextual Information in Human-Centered Environments using Weakly Supervised Mask Data Distillation

Under Review/arXiv

A Robot Architecture Using Context to Find Products in Crowded Unknown Shopping Environments

Under Review

An Autonomous Shopping Assistance Robot for Grocery Stores

IROS workshop

End to End Learning for Self-Driving Cars

arXiv

Projects

Education

University of Toronto

Master of Applied Science Candidate, Mechanical and Industrial Engineering

September 2018 - March 2021

Research related to artificial intelligence, computer vision, optical character recognition, weakly supervised learning, meta learning, and robotics.

University of Waterloo

Bachelor of Applied Science, Mechatronics Engineering

September 2013 - June 2018

Focus in robotics, artificial intelligence, and computer vision.

Contact

Flōt was my fourth year design project in undergrad (check out the project website for more information), where we explored the possibility of creating an autonomous blimp robot where it could navigate within a home and provide smart home assistance. The robot's payload provided significant technical challenges for the project severely limiting the selection of actuators, sensors, and on-board computation. We settled with a payload of a raspberry pi zero w, a propeller module, a sonar for height tracking and a camera. We explored variousmethods of learning to navigate within a complex home environment with the limited sensor payload,including: traditional methods such as mapping and detecting obstacles, as well as potential end-to-end learned methods like immitation learning, and deep RL.

Our implementation had to overcome several major obstacles, not limited to having a "real-time" controller and to limit as much as possible the amount of labeling and human involvement needed for its operation. These restrictions and in particular the desire for it to operate out of the box without having an explicit mapping directed us to the end-to-end approaches as part of our navigation stack.

Our first goal was to setup the training and evaluation code in simulation to enable rapid development. Many of the ideas that were used during this stage originally came from 'Learning to Fly by Crashing'. The training data that was used in this was collected by randomly sampling trajectories until crashing. A heuristic based labelling system was then applied to generate labels. At a high level, we took the intial part of the trajectories to be of the postive class, and the later part, just before crashing, to be of the negative class. We trained a CNN to attempt to reproduce these classifications. Overall, its reasonable to think of the network to be trying to output the probability of crashing if it were to follow a particular trajectory. After the intial rounds, we go back and collect more data from the failure cases and add them to our training set. This process is quite similar to the DAGGER algorithm.

Unfortunately, since our platform was slow moving this approach was not transferable to real life. We overcame this limitation by simulating the data collection technique in real life by manually moving a robot throughout a room and used distance measurements to determine possible collision scenarios for training. We manually filtered the data to remove label noise present from the limitations of the sonar sensor.

In the end we included amazon's alexa into the functionality of the robot allowing it to perform some basic functions like asking it to take a selfie.

Git

CNN Imitation learning Control AI