About¶
I’m a systems and reliability engineer with a background spanning three decades of infrastructure evolution, with the last 15+ years operating at what would now be considered a Senior/Staff level.
Across that time, my work has stayed centered on reliability, recovery, and understanding how systems behave under stress, from earlier infrastructure environments through modern platform and distributed systems.
How I Work¶
I tend to approach engineering from a systems perspective rather than a purely task-oriented one.
That usually means:
- looking for failure modes early and trying to design around them
- reducing operational noise instead of accepting it as normal
- treating observability and debugging as core parts of the system
- preferring deterministic, explainable behavior over implicit or overly magical systems
What That Looks Like in Practice¶
In practice, that has included:
- operating and maintaining production environments at significant scale
- designing private-cloud infrastructure and internal platforms
- building tooling that improves reproducibility and reduces ambiguity
- driving down alert fatigue and operational overhead through better system design
Working Philosophy¶
Over time, I’ve come to see reliability less as something you bolt on later and more as something that emerges from the way a system is shaped from the beginning.
A lot of the work I’m drawn to is really about shifting effort away from repeated reaction and toward designs that make those failures less likely in the first place.
Outside Work (and Why It Matters)¶

Outside of work, I’ve spent several years working with and socializing a Nile monitor lizard.
They have a reputation for being difficult to handle and unpredictable if not approached correctly. Getting from that baseline to a calm, interactive animal required a lot of structured iteration, observation, and patience.
The process ended up looking a lot like systems work:
- understanding behavior rather than forcing outcomes
- iterating based on feedback instead of assumptions
- building trust incrementally over time
- respecting that complex systems don’t respond well to shortcuts
It’s an unusual example, but it reinforced a lot of the same principles I rely on in engineering: patience, consistency, and letting the system tell you how it behaves instead of assuming you already know.