Product

TUXX as field proof

Services as proof for product ideas

The question that actually matters in October 2023

Not "what can we build?" That question has been too easy to answer for too long. The tooling is capable. The models are capable. The constraint has moved.

The useful question right now is: where is the friction repeating?

Repeated friction is not a complaint. It is a signal, one of the more reliable ones available if you are close enough to real work to observe it. It tells you where teams lose momentum, where people quietly abandon standards because the standard is too expensive to maintain, and where a better-designed system would change the outcome without requiring anyone to work harder. Friction that recurs across clients, across contexts, across people with different skill levels, that friction is pointing at something structural. Something worth building for.

TUXX exists, in large part, to stay close to that signal.

What the services function actually does

There is a version of a services company that exists to extract margin from repeatable executions. A team deploys a known solution, bills the hours, moves on. That is one model. It is not TUXX's model.

The more accurate framing is that TUXX operates as a field environment. When a custom system is built for a client, a workflow tool, an internal assistant, a process layer that sits between people and their data, the constraints become real in ways that desk-level thinking cannot anticipate. The edge cases are genuine edge cases, not imagined ones. The user resistance is genuine user resistance. The failure modes are instructive because they cost something.

What gets learned in that environment feeds back. Not just as notes, not just as "learnings" in the consulting-agency sense, but as structural insight into where a product layer would hold and where it would not. Every TUXX engagement is, among other things, a live test of a thesis about what software should do and how people actually operate inside it.

This is the field proof function. You do not get it from building in isolation. You do not get it from user research alone. You get it from shipping something real, watching it operate under pressure, and paying close enough attention to understand why certain things worked and why others did not.

The gap between a well-reasoned product spec and a system that survives contact with a client's actual operating environment is, almost always, larger than anyone anticipated. TUXX is what closes that gap, deliberately, rather than accidentally.

The commercial validation function

There is a specific question embedded in the field proof concept that is worth drawing out: if TUXX can deliver it for a client, can Orbit package it for a wider market?

This is the commercial validation function, and it is distinct from simply building things that clients will pay for. It is about understanding which elements of a custom engagement are genuinely custom, specific to that client's context, their data, their people, their existing systems, and which elements are structural patterns that will recur across businesses operating in similar conditions. The former stays in services. The latter is the raw material for product.

Getting this distinction right is harder than it sounds. The temptation, when a client engagement goes well, is to conclude that the thing built was the insight. It rarely is. The thing built is the vehicle for discovering the insight. The insight is in the pattern it revealed: the part of the process that was load-bearing, the decision point that kept surfacing, the moment where the handoff between human judgement and automated action turned out to matter most.

That is what TUXX is actually hunting for on each engagement. Not the deliverable but the underlying structure that the deliverable made visible. That structure, when it shows up reliably enough, is what Orbit is eventually built to handle at scale.

The loop between services and product

Orbit is the product surface that commercial execution eventually lands on. The ambition is a full operating system for lead-to-launched-product workflows, not a CRM, not a project management tool, not an AI wrapper bolted onto a spreadsheet, but a system that holds the entire commercial process as a coherent surface that a business can actually run on.

To build that credibly, you have to know which parts of that surface are genuinely hard to design well. You have to know where people will resist automation because the decision genuinely requires judgement, and where they embrace it because the task was mechanical and they resented doing it manually. You have to know what information a person actually needs at each stage, as opposed to what a product designer might assume they need.

TUXX generates that knowledge in live conditions. The insight loop runs between services and product: TUXX tests something under real constraints, the patterns become visible, Orbit is built to handle those patterns at scale. The services arm keeps the product honest. It ensures that what is being built in product is not a theory about how commercial work operates but a response to how it actually operates.

This is not a novel arrangement. Firms that have both a consulting practice and a software product often find the consulting arm quietly doing this work: identifying what is genuinely repeatable, what has genuine leverage, what a paying customer will actually use. The arrangement only works, though, if the people doing the services work are paying the right kind of attention. Attention to patterns, not just to deliverables.

Pattern Up as crystallised method

Pattern Up, the TUXX sub-product, is the place where that attention begins to crystallise. Where a pattern has been observed often enough, and understood well enough, to be worth naming and structuring: that is where a TUXX-built method becomes a repeatable thing that can be handed to a client team rather than rebuilt from scratch on each engagement.

This matters because the goal of the services function is not to stay in services indefinitely. The goal is to move what is repeatable into product, retain what requires genuine custom work in services, and continuously sharpen the line between the two. Pattern Up sits at that line. It is the point at which field-tested method becomes something packageable.

This is also where the distinction between Orbit and TUXX becomes clearest in practice. TUXX builds for specific contexts, with specific constraints, for specific people. Orbit builds for a wider market, for businesses that share the same underlying commercial problem even if the surface details differ. The movement from TUXX engagement to Orbit feature is a movement from specificity to generalisability. It is a meaningful step, and it requires honest judgement about what actually transfers.

Not everything transfers. Some of the most effective work TUXX does in a client environment is effective precisely because it is bespoke, shaped around people, process, and context that no generalised product could anticipate. That is fine. The goal is not to productise everything. The goal is to productise the things that should be productised, and to stay clear-eyed about what should not.

What the models are changing about this dynamic

In October 2023, the model landscape is evolving fast enough that the field proof function has become even more critical than it was twelve months ago. Not because the models are harder to work with; they are becoming easier. But because the surrounding product design question is becoming harder.

A capable model can answer. It can retrieve. It can synthesise. It can draft. What it cannot do, without deliberate product architecture around it, is direct its own attention appropriately, remember what matters across sessions, measure whether its outputs are actually moving something forward, recover gracefully from its own errors, and help a person reach the end of something rather than just the middle of it.

Those are product problems. They have always been product problems. But the existence of capable models has made them more visible, because now the gap between "a model that can do something impressive in a demo" and "a system that improves commercial outcomes over time" is almost entirely the product gap. Closing that gap requires knowing, with some specificity, what the operating problems actually are.

TUXX is one of the places that knowledge gets built. Benediction Lab is another, operating at the research layer, where the questions are about agents, memory systems, and the underlying architecture of how AI-powered systems can sustain reliable behaviour. The two arms are pointing at the same set of problems from different positions: services from the applied end, research from the structural end. When the two perspectives converge on the same problem, that is usually a signal worth taking seriously.

On the friction signal specifically

It is worth being precise about what "repeated friction" actually means in this context, because it is easy to mistake it for volume of complaint.

The friction signal is not that clients find something difficult. Some things are difficult because the underlying problem is difficult, and making it easier would require solving the underlying problem, which may be outside the scope of any tool. That kind of difficulty is not a product opportunity; it is a signal that expectations need calibrating.

The friction signal that is worth building for is where a task is genuinely mechanical, where the person doing it is not adding judgement, only effort, and the current system makes it expensive anyway. Or where a decision that requires human judgement is being made on inadequate information, not because the information is unavailable, but because no system has been designed to surface it at the right moment.

A third version also exists and is easy to overlook: where the system was designed to help but creates its own overhead: prompting the user at the wrong moment, surfacing information that is accurate but not actionable, or solving the stated problem while missing the actual one. Systems that generate this kind of friction are often well-intentioned and well-executed within the terms of their original spec. The spec was just wrong. Field proof is what reveals this, because the real environment doesn't respect the spec.

Those patterns recur across clients, across industries, across contexts. When they do, they are pointing at something that can be systematically improved rather than managed around. That is where TUXX's attention goes, and where the products eventually go in response.

Building the institution

The reason this framing matters, services as field proof, patterns as the currency, product as the endpoint, is that it is what distinguishes building an institution from running a project.

A project builds a thing. An institution develops the capacity to build the right things, over time, with progressively greater precision. The capacity comes from the feedback loop: real work generating real insight, insight shaping product direction, product going back into the field to be tested again.

The Mustard Seed Group portfolio is structured around that loop. Not as a theory but as a practical arrangement, with each part of the portfolio doing the work that its position in the system makes it best suited for. TUXX in the field. Benediction Lab at the research layer. Orbit at the product surface. The consumer side, through All Purpose and CheekyGains, testing what human capability systems look like in a different domain entirely, where the constraints are different, the users are different, but the underlying question is the same: how do you build a system that actually improves what a person can do?

The loop is not self-executing. It requires attention, discipline, and honesty about what the field is actually telling you versus what you would prefer to hear. Confirmation bias is as much a risk in services as anywhere else: it is possible to run engagements and see only the evidence that fits the thesis already held. The corrective is specificity: forcing yourself to articulate not just "it worked" or "it didn't work" but precisely which part worked, under which conditions, for which reasons, and what would have to change for the pattern to break.

What October 2023 is mostly about, from the inside, is keeping that discipline active. Staying close enough to the real friction to hear it properly, and moving deliberately enough that what gets built is a genuine response to it rather than a rationalised version of what was always going to be built.

The product question in October 2023 is still "where is the friction repeating?" The answer changes. The question holds.