← udacity portfolio
ml cv other · deep learning · December 2019

TensorFlow PoseNet (port)

Port of PoseNet from Caffe to TensorFlow, also during the Robot Perception course. Same model, friendlier toolchain.

What it was

Companion to the Caffe PoseNet fork — same model (end-to-end 6-DOF camera pose regression), same dataset, same paper. This one ported the architecture + weights to TensorFlow so the rest of the lab’s CV codebase (also TF-based) could load it without managing two frameworks.

What I’d do differently with hindsight

The whole “port between deep learning frameworks” exercise is largely obsolete:

But in 2019 this was a real workflow problem, and the port did teach me the GoogLeNet/Inception module by going through it layer by layer in two implementations.

What it taught me

Reading someone else’s deep-learning code is the fastest way to learn an architecture. PoseNet’s GoogLeNet backbone has those infamous “inception modules” — three branches with different kernel sizes concatenated. Porting forces you to look at every layer’s shape and stride. After that, papers reference Inception and the picture is clear instead of vague.


Source archive: Shivam-Bhardwaj/tensorflow-posenet (archived)
Writeup last touched: 2026-05-22