About¶

I’m a systems and reliability engineer with a background spanning three decades of infrastructure evolution, with the last 15+ years operating at what would now be considered a Senior/Staff level.

Across that time, my work has stayed centered on reliability, recovery, and understanding how systems behave under stress, from earlier infrastructure environments through modern platform and distributed systems.

How I Work¶

I tend to approach engineering from a systems perspective rather than a purely task-oriented one.

That usually means:

looking for failure modes early and trying to design around them
reducing operational noise instead of accepting it as normal
treating observability and debugging as core parts of the system
preferring deterministic, explainable behavior over implicit or overly magical systems

What That Looks Like in Practice¶

In practice, that has included:

operating and maintaining production environments at significant scale
designing private-cloud infrastructure and internal platforms
building tooling that improves reproducibility and reduces ambiguity
driving down alert fatigue and operational overhead through better system design

Working Philosophy¶

Over time, I’ve come to see reliability less as something you bolt on later and more as something that emerges from the way a system is shaped from the beginning.

A lot of the work I’m drawn to is really about shifting effort away from repeated reaction and toward designs that make those failures less likely in the first place.

Outside Work (and Why It Matters)¶

Nile monitor lizard

Outside of work, I’ve spent several years working with and socializing a Nile monitor lizard.

They have a reputation for being difficult to handle and unpredictable if not approached correctly. Getting from that baseline to a calm, interactive animal required a lot of structured iteration, observation, and patience.

The process ended up looking a lot like systems work:

understanding behavior rather than forcing outcomes
iterating based on feedback instead of assumptions
building trust incrementally over time
respecting that complex systems don’t respond well to shortcuts

It’s an unusual example, but it reinforced a lot of the same principles I rely on in engineering: patience, consistency, and letting the system tell you how it behaves instead of assuming you already know.