Waleed Abdulla

Engineering Leader · GenAI & 3D Perception

Waleed Abdulla

Engineering Leader · GenAI & 3D Perception

I lead an applied research organization at Apple, building and growing multiple teams that develop GenAI and Computer Vision algorithms behind key features of Vision Pro and other Apple products.

My interests include generative models, LLMs, object tracking, hand tracking, computer graphics, and synthetic data.

Previously, I worked on Computer Vision and Machine Learning projects across Silicon Valley startups, and created two open-source projects that earned 25K+ GitHub stars and around a thousand academic citations.

Earlier, I founded and led a venture-backed startup that I scaled to over 7 million users, and served on the board of Hacker Dojo, a non-profit technology community center in Mountain View.

LinkedIn, GitHub, Medium, Email

At a glance

Role: Engineering Leader, Apple
At Apple: Since 2018
Based in: Mountain View, CA
Focus: Generative 3D Models, Object Tracking, LLMs, Synthetic Data
Open source: 25K+ GitHub stars

My work at Apple (2018–present) is confidential, so I can't detail specific contributions. My organization develops algorithms in the areas covered by these public materials:

Explore enhancements to visionOS object tracking (2026)

WWDC26 session: new capabilities in object tracking and spatial accessory input on visionOS, including tracking moving and handheld objects and building custom spatial accessories.

Iterate your spatial scenes faster with Reality Composer Pro 3 (2026)

WWDC26 session: building immersive scenes in Reality Composer Pro 3, with content, visual effects, lighting, interactivity, and AI-assisted iteration in the editor.

visionOS 26 introduces powerful new spatial experiences for Apple Vision Pro (2025)

Apple Newsroom: overview of the new spatial experiences and capabilities introduced in visionOS 26.

Explore spatial accessory input on visionOS (2025)

WWDC25 session: developer guide to spatial accessory input on visionOS.

Explore object tracking for visionOS (2024)

WWDC24 session: developer guide to object tracking on visionOS.

visionOS 2 brings new spatial computing experiences to Apple Vision Pro (2024)

Apple Newsroom: overview of visionOS 2, including spatial photos and new hand gestures.

Meet ARKit for spatial computing (2023)

WWDC23 session: introduction to ARKit on visionOS, covering world tracking, scene understanding, and hand tracking.

Mask R-CNN for Object Detection and Segmentation

Star Fork

An implementation of Mask RCNN on Keras and TensorFlow. I built this during my work at Matterport and they graciously agreed to open source it. Since its release in November 2017, it has become one of the top instance segmentation models on TensorFlow and was used by thousands of developers in applications ranging from Kaggle competitions to Ph.D theses. It has earned around a thousand academic citations, plus 25K+ stars and 11K+ forks on GitHub.

https://github.com/matterport/Mask_RCNN

Mask R-CNN instance segmentation on a 4K street scene

HiddenLayer: Neural Network Visualization and Monitoring

Star Fork

Like TensorBoard, but simpler and runs directly in Jupyter notebook. It was adopted by Microsoft as one of the building blocks of their TensorWatch tool. Supports PyTorch, TensorFlow, and Keras.

https://github.com/waleedka/hiddenlayer

HiddenLayer neural network graph rendered in a Jupyter notebook

Splash of Color: Instance Segmentation with Mask R-CNN and TensorFlow (2018)

A question I kept getting about my Mask RCNN implementation was: how do I use it with my own dataset? So I built a toy example and documented the whole process, including manually annotating a small dataset from scratch. The result was an article and code to automatically apply a color splash effect on images of balloons (i.e. convert the image to grayscale except for balloons, which stay colored).

Link to article

Color splash effect: grayscale image with balloons kept in color

Traffic Sign Recognition with TensorFlow (2017)

I occasionally talk about deep learning at local events. This is a talk I did for ACM (Association for Computing Machinery) about using TensorFlow to recognize traffic signs.

Recording on YouTube

Waleed Abdulla

Waleed Abdulla

Selected Apple Work

Explore enhancements to visionOS object tracking (2026)

Iterate your spatial scenes faster with Reality Composer Pro 3 (2026)

visionOS 26 introduces powerful new spatial experiences for Apple Vision Pro (2025)

Explore spatial accessory input on visionOS (2025)

Explore object tracking for visionOS (2024)

visionOS 2 brings new spatial computing experiences to Apple Vision Pro (2024)

Meet ARKit for spatial computing (2023)

Open-Source Projects

Mask R-CNN for Object Detection and Segmentation

HiddenLayer: Neural Network Visualization and Monitoring

Earlier Talks and Articles

Splash of Color: Instance Segmentation with Mask R-CNN and TensorFlow (2018)

Traffic Sign Recognition with TensorFlow (2017)