CV | Isaac Bernat

Experience

Sabbatical Barcelona, Spain
Aug 2024 - current

An intentional period of professional development focused on deepening expertise in AI/LLM-driven engineering, system design and modern development workflows. Key activities include:

System Design: Authored a comprehensive, production-level technical design for a heuristic-based Spam Classification Engine, demonstrating a pragmatic approach to building new features within legacy systems.
AI/LLM Application & Tooling: Developed several open-source projects, including a data pipeline using Python and LLMs for a generative art archive and maintaining the popular netflix-to-srt tool (800+ stars).
Continued Learning: Completed specialized courses on AI/LLM applications from DeepLearning.AI and Coursera to stay current with emerging technologies.

A curated selection of GitHub Projects from this period is detailed below.

Preply Barcelona, Spain
Mar 2020 - Jul 2024

Staff Backend Engineer

Jun 2022 – Jul 2024

I shaped the technical roadmap and led initiatives that directly influenced the company's bottom line and engineering culture.

Business Impact: Drove the "Postpone Billing" experiment, the single most profitable initiative of 2023 (out of 215 A/B tests), delivering a +3% global Gross Margin lift. Championed the project for three quarters against initial resistance, turning short-term negative metrics into a massive long-term win. (Read the Case Study)
Strategic Foresight: Identified critical risks in a major company initiative ("Subs Direct") and formally advocated for a phased, iterative approach to mitigate over 10k+ hours in potential wasted development. The core of my proposed strategy was eventually adopted after the initial "big bang" approach proved too complex.
Cross-Functional Leadership: As the engineering representative for the Subscription Experience team I held bi-weekly stakeholder meetings with Finance, Data and CRM teams to coordinate on critical projects, ensuring data models and financial reporting remained accurate during high-stakes experiments. I also supported engineering-wide quality by conducting technical interviews for multiple backend roles.
Proactive System Ownership: Identified and led the mitigation of critical bugs outside my team's domain, including a flaw costing over $21k USD/week and a celery task spamming the queue with over 1 million executions per hour.

Senior Backend Engineer

Mar 2020 – Jun 2022

As the founding backend architect for the Subscription MVP, I led the technical transition from a package-only model to a recurring revenue system that became the company's primary growth engine.

Founding the Subscription Model: As the sole backend architect on the MVP, took on a high-risk challenge and led the transition from a package-only model to subscriptions. This foundational work grew to process over 200k+ monthly renewals and was instrumental in securing a Series C funding round. (Read the Full Case Study)
Incident Management & Observability: Commanded and documented 15+ production incidents, including a SEV-2 that went undetected for 16 hours and led to a new, mandatory pre-launch monitoring policy for all high-risk features company-wide. (Read the Incident Retrospective)
Team Scaling & Mentorship: After the MVP's success, helped build the dedicated Subscriptions team by interviewing 13 candidates and hiring 2 foundational engineers. Improved team agility by facilitating sprint retrospectives, cutting meeting time by 50% while increasing engagement.
Modernization & Performance: Led the backend migration of the high-traffic (1M+ MAU) Q&A application from Django templates to a GraphQL API. This initiative drastically improved performance (e.g. page load time reduced by ~85% and Googlebot crawl rate increased by 8x) and unblocked a series of A/B tests that resulted in a +300% lift in New Paying Customers from the blog and an +800% CVR increase on the Q&A user flow.

Ivbar Institute Stockholm, Sweden
Mar 2018 - Mar 2020

Fullstack Engineer

Jan 2019 – Mar 2020

Stepped into a fullstack role to meet team needs, contributing to both a Python backend and a React-based frontend for visualizing complex healthcare data.

Technical Expertise & Communication: Selected to speak at PyCon Sweden 2019, presenting a deep-dive on code optimization that demonstrated a >10<sup>18</sup>x performance improvement on a practical problem.
Data Visualization: Engineered the UI to display complex medical case-mix decision trees, empowering healthcare professionals to better understand and analyze treatment pathways.

Backend Engineer

Mar 2018 – Jan 2019

Developed data processing systems focused on data integrity and patient privacy.

Healthcare Analytics: Architected a core data processing engine for the Swedish National Quality Registry for Breast Cancer (NKBC), enabling the analysis of treatment outcomes across different hospitals to identify best practices.
Data Privacy & Anonymity: Implemented MECE-compliant privacy filters to ensure patient k-anonymity in sensitive medical datasets, a critical, non-negotiable requirement for project viability and ethical data handling.

Productos Aditivos Montcada, Spain
Jun 2016 - Jan 2018

Assumed a dual CTO/CFO leadership role in a medium-sized enterprise, driving both technology modernization and financial optimization initiatives.

Financial Strategy & Negotiation: Secured significant cost savings by renegotiating multi-year commercial loans, reducing interest rates by 35% on average and saving the company over 64k EUR. Also identified and rectified multi-thousand-euro payroll errors.
Technology & Vendor Management: Oversaw a critical ERP migration and managed the company-wide transition to the new Spanish SII real time tax reporting system, ensuring business continuity and compliance.
Data-Driven Business Intelligence: Initiated and led a market intelligence project using customs data to analyze competitor and supplier trends, providing key strategic insights to leadership that were previously unavailable.

Wrapp Stockholm, Sweden
Oct 2012 - Jun 2016

Progressed from an early, versatile engineer to a key Tech Lead during the company's critical "Wrapp 2.0" pivot, transforming the architecture from a legacy monolith to modern microservices. I was also selected to speak at the inaugural PyCon Sweden.

Tech Lead & Founding Engineer (2.0 Pivot)

Sep 2014 – Jun 2016

As a Tech Lead for "Wrapp 2.0", I helped guide the technical transformation from a legacy Python 2.7/Twisted monolith to a modern Go/Python 3 microservice architecture.

DevOps & Tooling: Architected and built "Opsweb," a mission-critical internal deployment system using Dart for managing canary releases of our 50+ microservices to AWS, complete with health checks and automated rollbacks.
Product Engineering: Contributed to the "Wrappmin" merchant dashboard (React), focusing on the backend APIs for mapping credit card transactions to offers.
Microservice Ownership: Created and maintained critical microservices in both Python (users) and Go (offers), coordinating with multiple contributors to ensure stability and performance.

Early Engineer (Backend, Data, Frontend)

Oct 2012 – Sep 2014

As an early engineer in a fast-growing startup, I demonstrated high versatility by taking on roles across the stack to meet evolving business needs.

Engineered Core Systems from Scratch: Built the company's first content recommendation engine and a comprehensive A/B testing email framework (delivering over 1M newsletters weekly).
Owned Redshift ETL Pipeline: Architected and optimized the company's core data warehouse pipeline for business intelligence and built real-time KPI dashboards.

FXStreet Barcelona, Spain
Feb 2012 - Jul 2012

Contributed to a high-traffic, multi-language financial news portal on a Microsoft Azure and C#/.NET stack, serving a global user base of forex traders.

Internationalization (i18n): Handled complex front-end challenges related to localization, ensuring UI/UX consistency across 17 languages, including right-to-left scripts (Arabic) and character sets with different spacing requirements (Russian, Japanese).
Platform Development: Maintained and developed features on the core C#/.NET platform, working within a Mercurial (Hg) version control system.

Case Studies

Model MVP: Building a Company-Defining Success (2021)

Led the backend architecture for a company-defining project that transitioned a manual, package-based business to a recurring subscription model. The solution resulted in a +41% YoY increase in LTV, an improvement on the LTV-to-CAC ratio from 1.4x to 3.0x and was instrumental in securing a Series C funding round.

Read the full case study ↓

The Challenge: Unpredictable Revenue and High User Friction

The company's business model was based on selling prepaid packages of hours (6, 12, or 20). This created two major problems:

For the Business: It led to unpredictable, lumpy revenue streams, making financial forecasting difficult.
For the User: The manual renewal process created significant friction. Students would often run out of hours unexpectedly, disrupting their learning cadence.

Our proposal, "Subscribe to a Tutor", aimed to solve this by introducing auto-renewing subscriptions, based on the hypothesis that predictable lessons would align user goals with a more stable, recurring revenue foundation for the business.

Defining Success: An MVP-First, Data-Driven Approach

Because this was a fundamental change to the financial model, our strategy was not to build a fully-featured system, but to launch a Minimum Viable Product (MVP) within a controlled A/B test.

We defined success criteria for two distinct phases:

Short-Term (MVP Launch): The immediate goal was successful validation. This required achieving statistical significance in user adoption, ensuring technical stability and generating the clean data needed to inform our decision to scale.
Long-Term (Post-Scale): Success would be measured by a sustainable increase in key business metrics, including User Retention, Customer Lifetime Value (LTV) and Gross Merchandise Value (GMV).

The Solution: Pragmatic Scoping and Product Decisions

My approach was centered on one goal: speed-to-learning. This involved several critical product and delivery decisions that I championed.

1. Strategic Billing Cycle

I advocated for a 4-week billing cycle over a standard monthly one. This was a crucial, non-obvious decision with multiple benefits:

User Alignment: It perfectly matched the weekly lesson cadence of our students.
Predictability: It created fixed-size packages every single time.
Business Velocity: It provided the business with customer insights ~8% faster and resulted in 13 billing cycles per year instead of 12, compounding revenue.

We anticipated a small increase in customer support queries from users accustomed to monthly billing, but we judged the significant benefits to be a worthwhile trade-off, which proved to be correct.

2. Aggressive MVP Feature Scoping

We deferred all non-essential features. For example, we launched without dedicated "upgrade/downgrade" functionality, knowing that users could achieve the same outcome through the existing workflow of canceling and re-subscribing. This kept the initial build lean and focused.

3. Strategic Plan Sizing

We limited the MVP to three distinct plans (1, 2, or 4 weekly hours). This was a deliberate choice to ensure direct comparability with the three tiers of the existing package model, eliminating "decision fatigue" or plan size as a confounding variable in our A/B test results.

4. A Carefully Restricted Audience

To accelerate delivery and minimize the "blast radius" of any potential issues, we launched to a very specific audience segment:

New users only (without referrals): To avoid bias from prior experience with the package model.
English-UI users only: To simplify the initial copy, design work and avoid localization overhead.
Web-only users: To bypass the slower, more complex release cycles of our mobile applications.
Single Payment Provider (Braintree): To avoid the significant effort of building recurring payment support for other (marginal) providers like Stripe or PayPal at the MVP stage.

Delivery & Iteration: Parallel Cohorts

Our most significant time-saving decision was to launch with a partial backend implementation. We went live with the user-facing sign-up flow while the auto-renewal logic and emails were still being built, allowing us to gather behavioral data weeks earlier.

This rapid approach enabled a sophisticated testing strategy. Instead of a single A/B test, we launched several experimental cohorts in parallel, each with an incrementally richer feature set (e.g. adding plan upgrades, more plan sizes). This had never been done before at the company and it provided the rich, long-term data needed to confidently scale the full subscription model.

Deep Dive: The Critical Architectural Decision

The most critical technical decision was how to safely integrate this high-risk system into our large, existing Python/Django monolith. I evaluated three options: deep integration (fast but risky), an isolated microservice (safe but slow) and the chosen hybrid path: a logically isolated module.

This approach offered the best balance of speed and safety. To de-risk it, I implemented several safeguards:

Centralized "Kill Switch": All subscription entry points were guarded by a single, dynamically controlled feature flag. This allowed us to instantly disable the entire feature for all users without a new deployment.
Data Isolation: We created new, dedicated tables for subscription data instead of adding columns to critical core tables, ensuring clean separation and preventing performance degradation for non-subscribers.
Code Isolation: All new code was developed within a distinct subscriptions module/directory, making it clear to other teams that this code was experimental and should not be depended upon.

The modular design proved its worth, allowing for both rapid validation and sustainable scalability. After the successful MVP, we were able to easily add major new features like plan upgrades/downgrades, pauses, hour top-ups and our "breakage" policy for unused hours.

The Results: A Company-Defining Success

The project became the most impactful experiment in the company's history. Within four months of scaling the model to all new users, we achieved:

A +41% year-over-year increase in LTV directly attributable to the project.
An LTV-to-CAC ratio that improved from 1.4x to 3.0x.
The successful close of a Series C investment round, larger than all previous rounds combined.

Within a year, the subscription model accounted for 64.4% of the company's total lesson revenue, fundamentally transforming its financial foundation.

Key Personal Learnings

Observability is a Feature, Not an Afterthought: We initially deprioritized building robust monitoring due to deadline pressure. This decision directly led to a SEV-2 incident that went undetected for 16 hours. I now consider comprehensive monitoring and alerting to be a non-negotiable part of the initial delivery of any critical system.
Lead with a Proposal, Not just Options: As the Directly Responsible Individual (DRI), I learned my role wasn't just to analyze trade-offs, but to lead with an assertive, well-reasoned recommendation. Embracing this mindset accelerated key architectural decisions.
The Power of Pragmatism (YAGNI): This project succeeded because we kept the MVP minimal. A later, failed attempt to launch "yearly subscriptions" was over-engineered for future flexibility that we never needed. It was a powerful lesson in the "You Ain't Gonna Need It" principle. I now challenge complexity much more forcefully, always advocating for the simplest solution that can validate the core hypothesis. (Read the full retrospective on that project).

Subscription Optimization: A Crisis Turned into Best Experiment (2023)

As the epic lead, I drove a company-wide initiative to overhaul our subscription pause and cancellation policies. This project, which initially showed negative results, was turned into the most impactful experiment out of all 215 A/B tests launched company-wide in 2023, delivering a +3% global Gross Margin lift by making a critical, data-informed decision to trust the long-term data over short-term noise.

Read the full case study ↓

The Challenge: A "Leaky Bucket" in Our Subscription Model

Our subscription model had a "pause" feature, but it was a blunt instrument. When users paused, they were completely locked out of taking lessons and any lessons they had scheduled were automatically canceled. This created significant user friction.

Data revealed a critical problem: users were canceling their subscriptions at twice the rate they were using the pause feature. They were opting for the more drastic "cancel" option simply to gain flexibility, creating a "leaky bucket" that drove high churn. Our hypothesis was that by offering a more user-friendly alternative to pausing and by aligning our cancellation policy with industry standards, we could improve both user retention and key financial metrics.

Uncovering a Hidden Liability

Digging deeper, I found that our existing system had a significant flaw: when users canceled, their prepaid hours remained on their balance indefinitely. This was not a deliberate product decision but an implementation artifact from the initial MVP. It created a growing, multi-million dollar liability on our books, as this money technically still belonged to the users. My proposal to expire hours on cancellation was not just about aligning with industry standards, it also was about resolving this hidden financial risk.

My Role: Epic Lead and Backend Architect

As the epic lead for this company-wide initiative, I was responsible for more than just the code. My role involved:

Coordinating a cross-functional effort of 15+ people from various teams, including Backend, Frontend, Product, CRM and Financial Data, each with their own different goals and priorities.
Championing the core product strategy, which I had advocated for across three consecutive quarters.
Architecting and personally implementing 100% of the backend logic.
Driving an accelerated delivery timeline by proactively de-risking the project.

The Solution: A Two-Part Strategy

We proposed a bold, two-part solution to be tested in a single A/B experiment:

A More Flexible "Postpone" Feature: We replaced the rigid "pause" with a new "Change Renewal Date" feature. This allowed users to postpone their next billing date by up to 20 days, once per cycle, without losing access to their lessons or having their scheduled classes canceled.
A Policy to Expire Hours on Cancellation: To create a clearer distinction and align with standard subscription practices, we implemented a policy where any unused hours would expire at the end of the billing cycle upon cancellation.

To accelerate the launch, I identified the critical path, which was blocked not just by technical tasks but by dependencies on our CRM, Financial Data and Frontend teams. I personally championed the project with these teams, negotiating to get our dependencies prioritized even when they fell outside their quarterly goals. This cross-functional leadership, combined with developing key backend components in parallel with ongoing user research, was instrumental in launching the full experiment four weeks ahead of our official schedule (2 sprints).

Ensuring a Scientifically Valid Test

A crucial part of the experiment design, which I insisted upon, was to ensure the integrity of our results. A significant portion of the "unexpired hours" liability came from users who had canceled months or even years prior. Including the one-time financial gain from expiring these historical hours in the A/B test would have massively inflated our short-term metrics and given us a misleading, irreproducible result. Therefore, I designed the backend logic to only apply the new expiration policy to users who canceled after the experiment start date, ensuring we were measuring the sustainable, long-term impact of the change.

Deep Dive: A High-Stakes Decision Based on Incomplete Data

Three weeks into the A/B test, the results looked disastrous. Our primary metrics were negative. The data showed a significant drop in immediate Gross Margin (GMV) and there was immense pressure from stakeholders to kill the experiment.

This was the project's critical moment.

I dug into the data and formulated a strong counter-hypothesis: the metrics were negative because the feature was working. Users were postponing payments, which naturally created a short-term dip in cash flow. The crucial missing piece of data was the long-term impact on retention. Based on early engagement signals showing "postpone" users were far more active than "pause" users, I made the high-stakes call not to kill the experiment, but to "freeze" it. This effectively stopped new users from entering, but allowed us to continue tracking the existing cohorts over a longer time horizon.

The Results: The Most Impactful Experiment of the Year

My hypothesis was proven correct. The long-term data showed a dramatic turnaround. The project was scaled and became the single most impactful initiative of 2023, out of 215 A/B tests (161 superiority and 54 non-inferiority experiments) launched company-wide.

Financial Impact: It delivered a +34% Gross Margin increase within the experiment group, contributing to a +3% global GM lift for the entire company.
Retention Impact: It drove a +4.4% increase in the subscription renewal rate and a +15% increase in plan upgrades.
User Trust: To mitigate the risk of the new policy feeling unfair, I worked with the CRM team to implement clear, proactive email warnings and built a 24-hour "grace period" into the backend logic to give Customer Support a window to easily reverse accidental cancellations. As a result, we saw no significant spike in user complaints related to the new policy.

This project became a new model for the company on how to analyze complex, long-term experiments and reinforced the value of making data-informed decisions, even when the initial signals are noisy and negative.

A Failed Experiment: Key Lessons from Yearly Subscriptions

A retrospective on a failed experiment to launch yearly subscriptions. This project became a powerful lesson in the importance of upfront product validation and the engineering principle of YAGNI ("You Ain't Gonna Need It"), leading to a more pragmatic and data-driven approach in subsequent work.

Read the full retrospective ↓

The Challenge: The "Obvious" Next Step

Following the immense success of our initial 4-week subscription model, the next logical step seemed to be launching a yearly subscription option. The hypothesis had several layers: we could increase long-term user commitment, create a more predictable annual revenue stream and further reduce churn by offering a compelling discount for a year-long commitment.

While we had qualitative data from user research teams suggesting this was a desired feature, the team, myself included, moved into the implementation phase with a high degree of optimism, without fully scrutinizing the underlying business and logistical assumptions.

The Technical Approach: A Mistake in Foresight

Anticipating that the business would eventually want other billing periods (e.g. quarterly for academic terms or summer programs), I made a critical technical error: I over-engineered the solution.

Instead of building a simple extension to support period=yearly, I designed a highly flexible system capable of handling any arbitrary billing duration. This added significant complexity to the codebase, testing and deployment process. My intention was to be proactive and save future development time, but it was a classic case of premature optimization.

The Outcome: A Failed Experiment and Valuable Lessons

We launched the A/B test, but the results were clear and disappointing: the adoption rate for the yearly plan was negligible. The discount we could realistically offer, after accounting for tutor payouts and our own margins, was simply not compelling enough for users to make a year-long financial commitment.

The feature was quickly shelved. The complex, flexible backend system I had built became dead code. This failure was a powerful learning experience that fundamentally improved my approach as an engineer and technical lead.

Learning 1: Validate the Business Case Before Building

The project's failure could have been predicted and avoided with a more rigorous pre-mortem and discovery phase. We jumped into building without asking the hard questions first.

Tutor Viability: The entire premise rested on offering a discount. We never validated if enough tutors were willing to absorb a significant portion of that discount. The handful who agreed to the pilot did so only after heavy negotiation, with the company subsidizing most of the cost, a model that was completely unscalable.
Logistical Complexity: We hadn't solved the critical operational questions. What happens if a student's tutor leaves the platform six months into a yearly plan? The processes for refunds, tutor reassignment and the accounting implications were undefined, creating massive downstream risk for our Customer Support and Finance teams.
Relying on Overly Optimistic Data: I learned to be more critical of qualitative user research that isn't backed by a solid business case. I took the initial presentations at face value, without questioning the difficult financial and operational realities.

This taught me to insist on a clear, data-backed validation of the entire value chain, not just user desire, before committing engineering resources.

Learning 2: The True Meaning of YAGNI

My attempt to build a "future-proof" system was a direct violation of the "You Ain't Gonna Need It" principle. The extra effort and complexity I added not only went unused but also made the initial build slower and riskier. This experience gave me a deep, practical appreciation for building the absolute simplest thing that can test a hypothesis. It's not about being lazy; it's about being efficient and focusing all engineering effort on delivering immediate, measurable value.

This project, more than any success, shaped my pragmatic engineering philosophy and my focus on rigorous, upfront validation.

CI/CD Optimization: Driving $31k in Annual Savings with a 1-Day Fix

Identified a key inefficiency in our CI/CD pipeline and led a data-driven initiative to fix it. This simple change, implemented in one day, resulted in an 80% reduction in unnecessary test runs, saving over $31,000 USD annually in infrastructure costs and hundreds of hours in developer wait time.

Read the full retrospective ↓

The Challenge: An Inefficient and Expensive CI Pipeline

In a May 2022 engineering all-hands meeting, a presentation on our infrastructure costs revealed a surprising fact: 18% of our entire AWS EC2 spend was dedicated to CI/CD. This sparked an idea. Our process ran a comprehensive, 15-minute unit test suite on every single commit, including those in Draft Pull Requests.

This created two clear problems:

Financial Waste: We were spending thousands of dollars every month running tests on code that developers knew was not yet ready for review.
Developer Friction: The Jenkins queue was frequently congested with these unnecessary test runs, increasing wait times for developers who actually needed to merge critical changes.

My Role: From Idea to Impact

As the originator of the idea, my role was to validate the problem, build consensus for a solution and coordinate its rapid implementation.

The Solution: A Data-Driven, Consensus-First Approach

My hypothesis was that developers rarely need the full CI suite on draft PRs, as they typically run a faster, local subset of tests. A simple change to make the CI run on-demand would have a huge impact with minimal disruption.

My approach was fast and transparent:

The Proposal: I framed the solution in a simple poll in our main developer Slack channel. The message was clear: "POLL: wdyt about only running tests on demand for Draft PRs? ... This could help reduce [our AWS costs]. ... We could type /test instead." I also credited the engineer who had already prototyped an implementation, building on existing team momentum.
Building Consensus: The response was immediate and overwhelmingly positive. Within a day, the poll stood at 20 in favor and only 2 against. With this clear mandate, we moved forward.
Rapid, Collaborative Implementation: I coordinated with the engineer from the Infrastructure team who had built the prototype. We ensured the new workflow was non-disruptive: developers could still get a full test run anytime by typing the /test command. We had the change fully implemented and ready for review the same day.

The Results: Immediate and Measurable Savings

The impact of this simple change, validated by data after three months of operation, was significant:

Drastic Reduction in Waste: We saw an 80% reduction in test runs on draft PRs (3,990 PRs without a command vs. 1,060 with one).
Verified Financial Savings: With a calculated cost of $1.95 per test run, this translated to immediate savings of over $2,600 USD per month, or an annualized saving of over $31,000 USD.
Improved Developer Productivity: The Jenkins queue became significantly less congested, saving hundreds of collective engineering hours per month that were previously lost to waiting. This directly translated to faster feedback loops and a more agile development cycle.

This project was a powerful demonstration of how a single, data-backed idea, when socialized effectively, can be implemented rapidly to deliver a massive, measurable return on investment by removing friction and eliminating waste.

Incident Command: Turning a Personal Mistake into Systemic Improvements

A retrospective on a SEV-2 incident where I took full ownership of a pre-launch misconfiguration for a critical MVP. This case study details the methodical response process and the key systemic improvements that resulted, turning a personal mistake into a valuable lesson for the entire engineering organization on the non-negotiable importance of observability.

Read the full retrospective ↓

The Challenge: A Pre-Launch SEV-2 Incident (Feb 2021)

Days before the planned launch of the company-defining Subscriptions MVP, we needed to conduct final Quality Assurance on our CRM email flows. This testing had to be done in the production environment to validate the integration with our email provider.

Due to a critical misunderstanding of our internal A/B testing framework's UI, I incorrectly configured the experiment. I believed I was targeting a small whitelist of test users, but I had inadvertently set the experiment live for 100% of eligible users.

The immediate impact was that thousands of customers were exposed to an incomplete, unlaunched feature. The partial experience consisted of incorrect copy promising auto-renewal and a different set of purchase plan sizes.

My Role: Incident Owner and Scribe

As the owner of the feature and the person who made the mistake, I took immediate and full responsibility. My role during the incident was twofold:

As the Incident Owner, I was responsible for coordinating the response, assessing the impact and driving the technical resolution.
As the designated Scribe, I was responsible for maintaining a clear, timestamped log of all actions and communications, ensuring we would have a precise record for the post-mortem.

The Response: A Methodical Approach Under Pressure

The incident went undetected for 16 hours overnight simply because we had no specific monitoring in place for this new flow. Once it was flagged the next morning, my response followed a clear hierarchy:

Immediate Mitigation: The first action was to stop the user impact. I immediately disabled the experiment in our admin tool, which instantly reverted the experience to normal for all users and stopped the "bleeding."
Diagnosing the Blast Radius: With the immediate crisis averted, I began the diagnosis myself. I queried our database's SubscriptionExperiment table and quickly identified that ~250 users had been incorrectly enrolled, far more than the handful of test accounts we expected.
Resolution and Cleanup: I wrote and deployed a data migration script to correct the state for all affected accounts. This ensured that no user would be incorrectly billed or enrolled in a subscription and that our A/B test data for the upcoming launch would be clean.

The incident was fully resolved in under an hour from the time it was formally declared.

The Outcome: Systemic Improvement from a Personal Mistake

While we successfully corrected the immediate issue, the true value of this incident came from the blameless post-mortem process that I led. The 16-hour detection delay became the central exhibit for a crucial change in our engineering culture.

The post-mortem produced several critical, long-lasting improvements:

Improved Tooling: We filed and prioritized tickets to add clearer copy, UX warnings and a "confirmation" step to our internal experimentation framework to prevent this specific type of misconfiguration from ever happening again.
A New Engineering Rule: We established a new, mandatory process: any high-risk feature being tested in the production environment must have a dedicated monitoring dashboard built and active before the test begins.
A Foundational Personal Learning: I had personally made the trade-off to deprioritize the monitoring and observability tickets for the MVP to meet a tight deadline. This incident was a powerful, firsthand lesson that observability is not a "nice-to-have" feature; it is a core, non-negotiable requirement for any critical system. This principle has fundamentally shaped how I approach every project I've led since.

This incident, born from a personal mistake, became a catalyst for improving our tools, our processes and my own engineering philosophy.

Proactive Ownership: A UX Fix for a +27% GMV Lift

Identified a simple user experience mismatch on our mobile homepage, proactively proposed a low-effort A/B test to a different team and drove a +27% increase in Gross Merchandise Value (GMV) from the affected user segment. This case study is a testament to the power of looking beyond assigned tasks and thinking like an owner of the entire product.

Read the full retrospective ↓

The Challenge: A Simple Observation, A Big Opportunity

While working on a backend task, I was reviewing our platform's user flow and made a simple observation. Our homepage used the same 'hero image' for all users: a person on a laptop. While this was perfectly appropriate for desktop visitors, it struck me as a subtle but significant disconnect for a user visiting our site on their phone. My hypothesis was that this 'one-size-fits-all' approach was creating a subconscious barrier, making the product feel less relevant to mobile users and potentially harming conversion.

My Role: Proactive Contributor

This was a clear example of an opportunity that fell outside my direct responsibilities and team's domain. My role was not to implement a fix, but to act as a proactive owner of the overall product experience. This meant validating my observation with data and building a compelling, low-friction proposal for the team that actually owned the homepage.

The Solution: A Data-Backed, Easy-to-Say-Yes-To Proposal

I knew that simply flagging the issue in a Slack channel would likely result in it being lost in the backlog. To drive action, I followed a three-step process:

Validate with Data: I partnered with a data analyst to confirm my hunch. We reviewed engagement metrics and confirmed that our mobile user segment indeed had a lower conversion rate compared to desktop users.
Formulate a Hypothesis: I framed my observation as a clear, testable hypothesis: "Showing a mobile-centric image to mobile users will create better resonance, leading to increased engagement and conversion."
Create a Low-Effort Proposal: I wrote a brief, one-page document for the relevant Product Manager. I didn't ask them to commit to a major roadmap change; I simply proposed a low-effort, high-potential A/B test. By doing the initial data validation and presenting a clear hypothesis, I made it as easy as possible for them to say "yes" and add it to their next sprint.

The Results: Outsized Impact from a Small Change

The homepage team was receptive and ran the A/B test. The results were immediate and exceeded all expectations. The mobile user cohort that saw a new, mobile-centric hero image demonstrated a massive improvement in key metrics:

+27% increase in Gross Merchandise Value (GMV)
+20% increase in total hours purchased

This initiative was a powerful lesson in the value of thinking like an owner. It proved that sometimes the most significant product improvements don't come from complex, multi-month engineering epics, but from a simple, user-centric observation and the initiative to see it through. It reinforced my belief that every member of a team has the ability to drive impact if they are empowered to look beyond their next ticket.

Strategic Alignment: A Lesson in "Disagree & Commit"

Identified a growing, underserved user segment (Music learners) experiencing significant friction due to product limitations. This case study details a data-backed advocacy effort to improve their experience, ultimately serving as a lesson in aligning individual insights with broader company strategy and the principle of "disagree and commit."

Read the full retrospective ↓

The Challenge: Overlooking Valuable Niche User Segments

Our platform's primary focus was language learning. However, I observed a significant and growing user base for other subjects, particularly Music. While these were not core growth areas, they consistently ranked in our top 10 most popular subjects by hours purchased. These subjects also had higher than average hourly rates, and represented substantial profitable revenue streams.

My concern was that these users faced unnecessary friction due to product limitations. For example:

Lack of Specialization Filters: Unlike languages (e.g. "Business English"), Music users couldn't search for "Guitar Tutor" or "Piano Tutor", forcing them to manually scroll through long lists and click into profiles.
Inflexible Subscription Frequencies: Many adult music learners preferred bi-weekly lessons due to practice time, but our subscription plans only offered weekly frequencies, pushing them towards less convenient options or even churn.

My hypothesis was that these overlooked user experience gaps were leading to unnecessary churn and hindering growth within these valuable niche segments.

My Role: Data-Driven Advocate

This initiative fell outside my direct team's roadmap. My role was to act as a data-driven advocate for these users, identifying the problem, quantifying the opportunity and proposing simple solutions to improve their experience.

The Solution: Building a Case for Niche Improvements

I approached this by building a clear, data-backed case:

Quantify the Opportunity: I gathered data on the total hours and GMV generated by Music subject, demonstrating their substantial contribution to the company's bottom line, despite not being a primary growth focus.
Highlight User Friction: I presented specific examples of user experience issues, backed by anecdotal feedback from customer support, illustrating how our generic platform was failing these specific users.
Propose Low-Effort Solutions: I outlined simple backend changes (e.g. adding instrument specializations, allowing bi-weekly frequency options for specific subjects) that I believed could deliver high ROI by reducing friction and improving retention in these segments.

I presented this proposal to product leadership, emphasizing the potential for "easy wins" by serving an existing, valuable user base better.

The Outcome: Strategic Alignment and "Disagree and Commit"

While the product leadership acknowledged the validity of my insights and the value of these user segments, they made a strategic decision to maintain laser-focus on core language growth. Resources were explicitly allocated away from non-core subjects to maximize impact in the primary business area.

The proposed features were not prioritized in the roadmap.

This project, while not resulting in a shipped feature, provided me with a crucial lesson in "disagree and commit." I learned that:

Advocacy is Essential, but Strategy is King: It's vital to advocate fiercely for what you believe is right, especially when backed by data.
Respect Broader Strategic Alignment: Once a strategic decision is made, even if you disagree with it, it's essential to understand the rationale and align your efforts with the broader company goals.
Professional Conduct: I documented my research and proposal in our internal knowledge base, ensuring the insights were preserved for future consideration. I then refocused my full energy on the prioritized roadmap items.

This experience reinforced the importance of strategic clarity and demonstrated my ability to contribute insights, influence discussions and ultimately commit professionally to the agreed-upon direction.

System Design

Spam Classification Engine (2025)

A technical design for a proof-of-concept spam classifier in a legacy Perl ecosystem. This document showcases a pragmatic, "white-box" heuristic approach chosen for its interpretability, maintainability and a phased evolution towards a modern ML service, demonstrating deep architectural thinking under realistic constraints.

Download Design Doc (PDF) →

Read the full document online ↓

Introduction: A Pragmatic Design for a Real-World Scenario

Designing a system to accurately identify web spam is a classic backend challenge. This document presents a technical design for a proof-of-concept (PoC) spam classifier. The goal was to solve a common, real-world engineering problem: how to build a new, critical feature within the constraints of an established, legacy ecosystem.

To simulate this scenario, the design was developed under a specific set of constraints that heavily influenced the architectural decisions:

A Legacy Tech Stack: The design assumes the target environment is a large, mature Perl monolith. This forces a focus on integration and maintainability by a team with deep expertise in that specific stack.
A Tight Timeframe for a PoC: The project was scoped for a rapid PoC, with an estimated 20-hour budget for the implementation phase. This required a design that prioritized simplicity and speed-to-delivery.
The Goal of Validation: The primary objective was to validate a classification strategy against a dataset, not to build a production-ready system from day one.

These realistic constraints led to a design that deliberately prioritizes interpretability, tunability and leveraging existing strengths over building a "perfect" solution in a theoretical vacuum.

Assumed Inputs

This design focuses exclusively on the classification engine. It assumes the prior existence of two key inputs:

A Corpus of Scraped Websites: A local directory containing the raw HTML content of various websites.
A Labeled Dataset: A simple text file that serves as the "ground truth," mapping each website's domain to a binary label (spam/not-spam).

The design also assumes this initial dataset is well-balanced and representative enough to create a useful set of initial heuristic rules. The document does not cover the implementation of the web scraper or the initial data labeling process.

Defining Success: A Trust-Centric Approach

For a PoC, success is about proving the viability of an approach against clear measurable goals, not just delivering functional code. The core value of this classifier lies in its ability to effectively identify spam while upholding a high standard of trust.

This led to a core tuning philosophy that guided all subsequent metrics and decisions: A False Positive (a legitimate site incorrectly flagged) is significantly more harmful to user trust than a False Negative (a spam site that is missed). Therefore, the entire design is deliberately optimized to ensure the integrity of the results.

Quantitative Goals

To that end, I established a clear set of success criteria:

Primary Metric (Precision > 90%): Reflecting the core philosophy, the system must be highly precise. At least 90% of the sites it flags as spam must actually be spam.
Secondary Metric (Recall > 70%): The system must still be effective, correctly identifying at least 70% of all true spam sites in the dataset.
Health Metric (Accuracy > 80%): The overall accuracy must be significantly better than random chance, serving as a general health check.

Tuning Levers

The design's configuration file would provide two main levers to achieve these metrics:

Rule Weighting: Individual rule scores would be carefully calibrated. Rules with a higher risk of producing false positives would be given more conservative (lower) scores.
Threshold Setting: The final classification threshold would be set conservatively high, directly enforcing the "precision-first" principle.

Stretch Goal: Weighted Performance

A more mature success metric, beyond the initial PoC, would be to weight the performance scores by a site's real-world impact. Misclassifying a high-traffic, popular domain is far more damaging than misclassifying an obscure one. A future version of the efficacy test would incorporate a public list like the Tranco Top 1M to give more weight to errors on popular sites, providing a better measure of the classifier's impact on the user experience.

This quantitative and philosophical approach provides a clear framework for evaluating the PoC and tuning its performance.

Core Design Philosophy: Pragmatism Within Constraints

The design is guided by a philosophy of pragmatism, focusing on delivering maximum value and minimizing risk within the defined constraints. When considering the addition of a feature to a legacy system, stability and maintainability are paramount. This led to three core principles:

Interpretability Builds Trust: For a new, critical system like a spam classifier to be adopted by an existing team, its decisions must be explainable. I chose a "white-box" heuristic approach where every classification can be traced back to a specific set of human-readable rules. This is crucial for gaining stakeholder trust and makes the system far easier to debug and maintain than a "black-box" alternative.
Tunability Enables Safe Iteration: A system with hard-coded logic is brittle and risky to modify. My design externalizes all classification rules and their associated weights into a simple YAML configuration file. This decouples the core logic from the code, allowing for safe, incremental tuning and the addition of new rules without requiring a full code deployment and its associated risks.
Leveraging the Existing Ecosystem: The most critical constraint was the mature Perl environment. Instead of fighting this, the design embraces it as a strategic advantage. It leverages Perl's renowned text-processing capabilities, which are exceptionally well-suited for a rule-based engine. Furthermore, it draws architectural inspiration from the proven, time-tested patterns of Apache SpamAssassin, a highly influential and successful open-source project in the domain of heuristic-based filtering. This approach ensures that the proposed solution would be easily understood, built and maintained by an existing Perl-focused engineering team, drastically reducing the project's real-world risk profile.

The Proposed Solution: A Configurable Heuristic Engine

The proposed solution is a modular, self-contained Perl application that functions as a configurable, weighted heuristic engine. This approach directly models the methodology used by successful, time-tested systems like Apache SpamAssassin, which combine a set of distinct rules or "signals" to compute a cumulative spam score. A site is classified as spam if its total score exceeds a predefined threshold.

The application's architecture is designed for simplicity, clarity and testability, with a linear data flow and three decoupled core components.

1. The Feature Extractor (`FeatureExtractor.pm`)

This is the data collection module. It is responsible for all I/O and parsing, with a single responsibility: to take a path to a scraped website's content and return a simple dictionary (or key-value object) of well-defined features. It has no knowledge of the spam rules themselves.

The initial set of features was inspired by common spamdexing techniques and focuses on signals that are both predictive and computationally inexpensive. They fall into several categories:

Content-Based: Keyword stuffing density, presence of hidden text (e.g. via CSS) and the text-to-HTML ratio to detect "thin content."
Link-Based: Total count of outbound links and the ratio of external-to-internal links.
URL-Based: Lexical analysis of the domain name itself, such as the count of hyphens or numbers.
Structural: Presence of tags commonly used for deceptive redirects, like <meta http-equiv="refresh">.

2. The Configurable Rules Engine (`RulesEngine.pm`)

This is the core logic of the classifier. To provide maximum flexibility and tunability, the rules are not hard-coded. Instead, they are defined in an external, human-readable rules.yaml file. This completely decouples the classification logic from the application code.

A typical rule in the configuration consists of:

Name: A unique, descriptive identifier (e.g. EXCESSIVE_OUTBOUND_LINKS).
Feature: The key from the FeatureExtractor to inspect (e.g. num_outbound_links).
Operator: The comparison to perform (e.g. GREATER_THAN).
Value: The threshold to compare against (e.g. 200).
Score: The weight (positive or negative) to add to the total spam score if the rule is triggered.

3. The Orchestrator (`classify_site.pl`)

This is the main command-line script that serves as the application's entry point. It orchestrates the entire process:

Parses the command-line arguments (e.g. the input file path).
Loads the classification threshold and all rule definitions from the rules.yaml file.
Invokes the FeatureExtractor with the input path to get the site's features.
Passes the extracted features and the loaded rules to the RulesEngine to compute a final spam score and a list of triggered rules.
Compares this score against the threshold to determine the final classification. The orchestrator then produces a final result object containing the classification (spam/not-spam), the total score and the list of specific rules that were triggered, providing full explainability for the decision.

Deep Dive: The Strategic Decision to Defer Machine Learning

The most obvious alternative to a heuristic engine is a Machine Learning (ML) classification model. While an ML approach offers high potential accuracy, I made a deliberate, strategic decision to reject it for the initial PoC phase. This was a pragmatic choice driven by the project's specific constraints: a tight timeframe and the context of a legacy Perl ecosystem.

The key trade-offs I considered were:

1. Expanding on the "Interpretability Builds Trust" Principle

As stated in the core design philosophy, a "white-box" solution is crucial for building trust. The deep dive reveals why this is so critical in practice:

For Debugging: A heuristic approach is fully traceable. If a site is misclassified, we can see the exact list of rules that were triggered and their scores. This makes identifying and fixing a faulty rule a simple and deterministic process.
For Stakeholder Buy-in: When demonstrating the PoC to product or business stakeholders, being able to explain precisely why a site was flagged (e.g. "it has an excessive number of outbound links and hidden text") is far more convincing than saying "the model's prediction was 0.92."
For Team Adoption: For the existing engineering team that would inherit the system, the transparent logic of a heuristic engine dramatically lowers the maintenance burden and empowers them to contribute new rules with confidence.

2. Implementation Feasibility vs. Ecosystem Mismatch

The ~20-hour implementation budget and the Perl technology stack made a full ML workflow infeasible and unwise.

A responsible ML workflow requires a significant investment in data pipelines, feature scaling, cross-validation, hyperparameter tuning and systematic error analysis. Attempting to rush these steps would produce an untrustworthy model.
The Perl ecosystem, while powerful for text processing, lacks the mature, world-class ML libraries and tooling of a language like Python. Building a robust ML pipeline in this environment would require significant foundational work, placing it well outside the scope of a rapid PoC.

3. Robustness in a Low-Data Environment

A PoC typically starts with a limited dataset. In this scenario, a complex ML model is at high risk of "overfitting" (learning spurious correlations that do not generalize to new websites). A carefully crafted heuristic system, based on the established first principles of web spam (as documented in sources like Wikipedia and academic research), is often more robust and performs more predictably when data is scarce.

In summary, the decision to defer ML was a conscious, pragmatic trade-off. It prioritized speed-to-learning, interpretability and a low-risk integration path into the existing ecosystem, the most important factors for a successful PoC.

A Multi-Layered Testing Strategy

A classifier is only as good as the confidence we have in its results. Therefore, the design includes a comprehensive, multi-layered testing strategy to ensure correctness, robustness and performance.

Unit Tests (Correctness): These tests would validate the individual components in isolation. For example, we would provide the FeatureExtractor with small, self-contained HTML strings to assert that it correctly counts links and calculates ratios. We would provide the RulesEngine with hardcoded feature dictionaries (key-value objects) to assert that the score calculation is mathematically correct.
Integration Tests (Cohesion): These would test the entire application flow, from command-line argument parsing to the final output. By running the main classify_site.pl script against temporary sample files, we could assert that the components work together as expected.
Efficacy / Acceptance Testing (Performance): This is the most critical layer to evaluate the quality of the results. The design includes a dedicated evaluation function that runs the classifier against the entire labeled training dataset. It compares the system's predictions using the ground-truth labels and computes the final Accuracy, Precision and Recall metrics. This provides the quantitative feedback loop essential for tuning.
Adversarial Testing (Robustness): To ensure the system is resilient against real-world, messy data and adversarial attacks, the design includes a strategy for adversarial testing. This would involve a suite of tests that programmatically generate synthetic, challenging test cases:
- Spam Injection: This test would take known "good" sites from the training set and automatically inject spam signals (e.g. adding 100 instances of a keyword in a <div> with style='display:none;'). The test would then assert that the site's spam score increases as expected.
- Evasion Testing: This test would take known "spam" sites and attempt to break the classifier. For example, it could programmatically mutate the HTML by removing random closing tags or adding malformed attributes, asserting that the application does not crash and still makes the correct classification.

The Path to Production: From Prototype to Integrated Service

The PoC is designed as a standalone CLI tool for safe, offline validation. The journey to a robust, production-grade system involves a pragmatic, phased integration into the existing legacy monolith and a strategic evolution of the technology stack.

Phase 1: Asynchronous Integration & Performance Optimization: The first step is to wrap the classifier's logic into an asynchronous worker service. A separate crawling system would be responsible for fetching website content and storing it. The crawler (or another service) would then place a message on a queue containing a pointer to this stored content. The worker would consume these messages, fetch the content, perform the classification and store the result. This is a low-risk integration pattern that prevents the classifier from adding any latency to user-facing requests.
Phase 2: Building a Dynamic Data Pipeline: The static training set would be replaced by a dynamic data pipeline that continuously scrapes and refreshes website data. To allocate resources efficiently, this pipeline would prioritize refreshing popular, high-traffic domains more frequently than less common ones, ensuring the classifier's data is freshest where it matters most.
Phase 3: Strategic Evolution to a Python/ML Service: Once the heuristic engine is proven and a rich dataset has been collected, the next logical step is to introduce Machine Learning. At this point, it becomes a sound architectural decision to build a new, dedicated V2 service in Python to leverage its world-class ML ecosystem (scikit-learn, pandas, etc.).
- This new Python service would initially consume the same features from the FeatureExtractor, allowing for a direct, apples-to-apples comparison between the heuristic model and the new ML model.
- This phased approach allows the organization to strategically introduce a new technology (Python) for a specific, high-value problem, rather than attempting a risky, large-scale rewrite.

Production Optimizations

As the system moves to a high-volume production environment, several optimizations would be implemented:

Early Exit (Short-Circuiting): After the initial rule set is validated, the engine would be modified to stop processing rules as soon as a site's cumulative score exceeds the threshold. During the PoC phase, this is intentionally disabled to ensure we collect data on all triggered rules for better analysis.
Concurrency Model: The design is inherently efficient, as the classification process is primarily I/O-bound (reading files) rather than CPU-bound (the heuristic checks are computationally cheap). This means a single worker instance could be made more efficient by using an event-driven, non-blocking I/O model to process multiple sites concurrently, maximizing throughput.

Evolving the Product Capabilities

Beyond the technical migration, the architecture allows for significant enhancements over time:

Richer Heuristics: The modular FeatureExtractor is designed for extension. While the PoC focuses on simple, static HTML analysis, a production version would be enhanced to extract more sophisticated signals, such as detecting obfuscated text or analyzing the behavior of embedded scripts, to combat more advanced spam techniques.
Automated Rule Tuning: With a dynamic data pipeline in place, we could implement a workflow for automatically tuning the rule weights (or model re-training) based on the latest data, ensuring the classifier continuously adapts to new spammer tactics.
Reputation History: The system could begin tracking a domain's classification over time. A site that frequently flips between spam/not-spam could incur a reputation penalty, making it harder for adversarial actors to game the system.
Finer-Grained Classification: While the PoC may operate on a pre-aggregated "site" level, the underlying FeatureExtractor works on a page-by-page basis. A major evolution would be to expose this page-level classification to handle large platforms with user-generated content (e.g. github.io). This would allow the system to flag a single malicious page without penalizing the entire legitimate domain.

This evolutionary path, encompassing both the technical stack and the product's capabilities, allows for a gradual, de-risked migration from a simple Perl prototype to a sophisticated, Python-based ML service. It ensures that value is delivered and validated at every stage, prioritizing long-term stability and effectiveness.

Reimbursement API (2017)

Design and implementation of a healthcare reimbursement system.

Architecture: Demonstrates a clean microservice architecture using containerized Python and PostgreSQL services orchestrated with Docker. The design follows REST principles and isolates concerns for scalability.
Strategic Decisions: The accompanying documentation outlines clear assumptions, API design with versioning considerations, error handling strategies and a thoughtful discussion on performance trade-offs, showcasing a mature approach to software development.

View on GitHub →

URL Shortener (2012)

A high-performance URL shortener, implemented in under 100 lines of Python for Google App Engine.

Architecture: The solution uses base64-encoded database IDs to generate minimal-length URLs and employs two caching strategies to achieve near-zero database reads for repeat lookups.
Scalability: The design document includes a detailed analysis of long-term scaling, covering CDN usage, database sharding, consistent hashing and other production-grade considerations, demonstrating foresight beyond the initial implementation.

View on GitHub →

Personal & Open Source Projects

netflix-to-srt

View on GitHub →

This popular open-source tool (800+ stars) provides a robust solution for converting Netflix subtitle files into the standard SRT format. It was created to solve a personal need that the community also shared, demonstrating the ability to identify a gap and build a widely adopted tool.

Dual Interface: Offers both a powerful Python CLI for batch processing and automation and a user-friendly, client-side JavaScript web interface for single-file conversions.
Community Driven: The project has grown to include multiple methods for acquiring subtitle files, timestamp shifting capabilities and has received contributions from the community.

This Homepage

View on GitHub →

This portfolio site itself is a testament to my engineering philosophy. It's a static site built without a heavy framework, using a custom Node.js script for a fully controlled and transparent build process.

Architecture: Employs a "headless" approach, with all narrative content authored in Markdown for easy maintenance. The build process features parallelized tasks and per-page CSS loading to keep the homepage's render path exceptionally fast.
Discipline: The entire Git history follows the Conventional Commits specification, ensuring a clean, readable and automated changelog.
Quality: Focuses on performance, accessibility (WCAG compliant) and maintainability, as detailed in the project's README.

Basepaint Media Pipeline

An archival project for the generative art community basepaint.xyz. This project features a pipeline of Python scripts to fetch, process and enrich artwork metadata using LLMs. The final output is a series of high-quality, printable PDFs containing the art, creation stats and AI-generated descriptions of each piece, making the collection accessible and well-documented.

View on GitHub →

Rhythm Radar (2016)

An experimental real-time rhythm visualizer built with D3.js. This proof-of-concept uses polar coordinates to create an intuitive "radar" display for drum patterns, offering a novel alternative to traditional linear notation. The project was inspired by a desire to to understand complex rhythms in real time, especially when multiple instruments and loops are involved.

View on GitHub →

Docker Remote Debugging

A practical guide and working example for debugging Python code remotely within a Docker container. This project provides a clear, step-by-step solution using pudb and telnet, complete with sample code and configuration, to improve development workflows in containerized environments.

View on GitHub →

Moon Cycle Data Analysis

A data-centric project providing daily moon phase data from 1800-2050 in a clean CSV format. The repository includes two Python scripts: one using the ephem library for high-accuracy calculations and a dependency-free version for faster, less precise estimates. This demonstrates an understanding of both scientific accuracy and practical performance trade-offs.

View on GitHub →

Tinymem

A Simon-inspired memory game for the Thumby, a keychain-sized programmable console.

Technology: Written in under 50 lines of MicroPython, this project demonstrates resource-constrained development, making use of the device's audio, sprites and input buttons.
Purpose: Served as a proof of concept for a complete, playable game with a tiny code footprint. It was the subject of a presentation at PyDayBCN 2023.

View on GitHub →

Pycrastinate (2014)

A language-agnostic tool for managing TODO and FIXME comments across entire codebases, presented in full at PyCon Sweden 2014 and abridged at EuroPython 2014.

Architecture: Built as a configurable pipeline, the tool finds tagged comments, enriches them with git blame metadata (author, date) and generates reports.
Purpose: This early project was an exploration into building developer tooling and demonstrates an early interest in code quality and maintainability.

View on GitHub →

Presentations

From 1+ Billion years to less than 1 second

Presented at PyCon Sweden, this talk demonstrates a >10¹⁸x performance improvement using 10+ techniques for code optimization, with performance comparisons between Python, PyPy, and C++.

View all presentations on GitHub →

Education & Interests

M.Sc. from NTNU/UPC, fluent in English and active in community leadership.

M.Sc. in Informatics Engineering, Highest Honours

Norwegian University of Science and Technology (NTNU) & Universitat Politècnica de Catalunya (UPC), 2011

Completed a 5-year, 300+ ECTS program in Informatics Engineering at UPC, with specializations in Data Management and Software Engineering.
Authored a Master's Thesis on Test-Driven Conceptual Modelling at NTNU as part of the ERASMUS programme. This received the highest possible grade (A with Highest Honours) from both institutions.

Languages

Native: Catalan, Spanish
Full Proficiency: English (10+ years professional experience; Cambridge CPE/C2)
Conversational: Swedish (6 years residency; A2 certified)
Basic: Norwegian (EILC Bokmål course)

Leadership & Interests

Community Leadership (VP & Treasurer, 2006-2010): Co-managed a non-profit cultural association aimed at providing positive leisure alternatives for local youth. Oversaw budgets, secured public funding and organized community events, including annual LAN parties and boardgame weekends.
Personal Interests: I enjoy classical music, performing on piano, harpsichord and participating in organ masterclasses. I'm also an avid reader and fond of indie games, having developed PoCs for Thumby like a Simon clone and a Pong clone.

Credentials

Degrees, diplomas and language certificates are available for review on GitHub.

Isaac Bernat

Experience

Staff Backend Engineer

Senior Backend Engineer

Fullstack Engineer

Backend Engineer

Tech Lead & Founding Engineer (2.0 Pivot)

Early Engineer (Backend, Data, Frontend)

Case Studies

Model MVP: Building a Company-Defining Success (2021)

The Challenge: Unpredictable Revenue and High User Friction

Defining Success: An MVP-First, Data-Driven Approach

The Solution: Pragmatic Scoping and Product Decisions

1. Strategic Billing Cycle

2. Aggressive MVP Feature Scoping

3. Strategic Plan Sizing

4. A Carefully Restricted Audience

Delivery & Iteration: Parallel Cohorts

Deep Dive: The Critical Architectural Decision

The Results: A Company-Defining Success

Key Personal Learnings

Subscription Optimization: A Crisis Turned into Best Experiment (2023)

The Challenge: A "Leaky Bucket" in Our Subscription Model

Uncovering a Hidden Liability

My Role: Epic Lead and Backend Architect

The Solution: A Two-Part Strategy

Ensuring a Scientifically Valid Test

Deep Dive: A High-Stakes Decision Based on Incomplete Data

The Results: The Most Impactful Experiment of the Year

The Challenge: The "Obvious" Next Step

The Technical Approach: A Mistake in Foresight

The Outcome: A Failed Experiment and Valuable Lessons

Learning 1: Validate the Business Case Before Building

Learning 2: The True Meaning of YAGNI

The Challenge: An Inefficient and Expensive CI Pipeline

My Role: From Idea to Impact

The Solution: A Data-Driven, Consensus-First Approach

The Results: Immediate and Measurable Savings

The Challenge: A Pre-Launch SEV-2 Incident (Feb 2021)

My Role: Incident Owner and Scribe

The Response: A Methodical Approach Under Pressure

The Outcome: Systemic Improvement from a Personal Mistake

The Challenge: A Simple Observation, A Big Opportunity

My Role: Proactive Contributor

The Solution: A Data-Backed, Easy-to-Say-Yes-To Proposal

The Results: Outsized Impact from a Small Change

The Challenge: Overlooking Valuable Niche User Segments

My Role: Data-Driven Advocate

The Solution: Building a Case for Niche Improvements

The Outcome: Strategic Alignment and "Disagree and Commit"

System Design

Spam Classification Engine (2025)

Introduction: A Pragmatic Design for a Real-World Scenario

Assumed Inputs

Defining Success: A Trust-Centric Approach

Quantitative Goals

Tuning Levers

Stretch Goal: Weighted Performance

Core Design Philosophy: Pragmatism Within Constraints

The Proposed Solution: A Configurable Heuristic Engine

1. The Feature Extractor (FeatureExtractor.pm)

2. The Configurable Rules Engine (RulesEngine.pm)

3. The Orchestrator (classify_site.pl)

Deep Dive: The Strategic Decision to Defer Machine Learning

1. Expanding on the "Interpretability Builds Trust" Principle

2. Implementation Feasibility vs. Ecosystem Mismatch

3. Robustness in a Low-Data Environment

A Multi-Layered Testing Strategy

The Path to Production: From Prototype to Integrated Service

Production Optimizations

Evolving the Product Capabilities

Reimbursement API (2017)

URL Shortener (2012)

Personal & Open Source Projects

netflix-to-srt

This Homepage

Basepaint Media Pipeline

Rhythm Radar (2016)

Docker Remote Debugging

Moon Cycle Data Analysis

1. The Feature Extractor (`FeatureExtractor.pm`)

2. The Configurable Rules Engine (`RulesEngine.pm`)

3. The Orchestrator (`classify_site.pl`)