Performance

Performance standards and accountability

Standards as a performance interface

The two halves that need each other

There is a particular kind of organisational dysfunction that most managers recognise but few name precisely. It looks like this: a team sets a goal, the goal goes unmet, and nothing happens. Not a conversation. Not an investigation. Not a recalibration. Simply: the goal is quietly updated downward for the next cycle, or restated with fresh vocabulary, or replaced entirely with something that has not yet had a chance to fail.

This is what happens when you have standards without accountability.

The equally common mirror image is subtler and, in its own way, more demoralising. Someone is held accountable, called in, reviewed, assessed, but the standard being applied was never made clear to them in the first place. The review happens. Judgements are delivered. The person leaves the room uncertain what they should have done differently, because the target was always somewhat implied rather than stated.

This is what happens when you have accountability without standards.

Neither condition produces performance. Standards without accountability are aspirational at best, delusional at worst. Accountability without standards is theatre: the performance of seriousness without the substance of it. Getting both right, at the same time, in a system that actually functions, turns out to be one of the harder design problems in institutional life.

What a standard actually is

The word "standard" gets used loosely enough that it is worth pinning down.

A standard is not a wish. It is not a value statement. It is not a mission. It is a specific, observable description of what acceptable performance looks like, stated in terms that are clear enough for two people who disagree to eventually agree on whether the standard was met.

That last clause matters more than people typically acknowledge. The test of a good standard is not whether the person who set it knows what they meant, they almost always do, but whether the person being held to it can reconstruct that meaning independently. Most performance standards fail this test. They are written in language that is legible only to the author, and the author assumes that clarity is shared when it is not.

This is partly a laziness problem. Writing a genuinely precise standard takes longer than writing a vague one. It requires thought about edge cases, about what exactly counts as hitting the mark versus nearly hitting it, about the difference between a reasonable miss and an unreasonable one. It is easier to write "achieve excellent client outcomes" than to define, concretely, what distinguishes an excellent outcome from a mediocre one in your specific context.

But it is also a discomfort problem. Precise standards make accountability unavoidable. If the standard is vague, disagreement about whether it was met can be absorbed into interpretation. If the standard is precise, the question of whether someone cleared the bar becomes harder to avoid. Some organisations prefer the wiggle room.

The three things accountability requires

Strip away the jargon and the management literature and the coaching frameworks, and accountability in practice requires exactly three things.

First, someone who cares. Accountability does not self-execute. It requires a person, a manager, a coach, a peer, a system that represents someone, who notices when a standard is not met and considers that noticeable. This sounds obvious. It is not. In many environments the caring has been so diluted across layers of hierarchy, or so smothered under competing priorities, that missed standards pass without comment not because no one is paying attention but because noticing has been implicitly decoupled from the authority or incentive to act on what is noticed.

Second, a standard that is clear. We have already spent time here. The point is simply that clarity is not a nicety: it is a precondition. You cannot honestly hold someone to account against a standard they could not have reconstructed without asking you for clarification.

Third, a review that is honest. This is the step most often compromised. The review, whether it is a weekly check-in, a monthly performance conversation, a real-time coaching note, needs to be accurate. Not kind, exactly, but honest in the sense of being calibrated to reality rather than calibrated to comfort. A review that consistently softens the truth to avoid awkwardness is not a review. It is a social ritual, and both parties usually know it.

These three things form a system. Remove any one of them and the other two cannot compensate. You can have a reviewer who cares deeply and standards written with precision, but if the reviews are dishonest the loop never closes. You can have honest reviews and someone who genuinely cares, but if the standard is unclear the honesty has nothing to be honest about. All three, consistently, is what a functioning accountability system actually requires.

Why systems outlast motivation

The performance conversation tends to focus on motivation: how to kindle it, how to sustain it, how to manage the inevitable dips. This is not without value. But it tends to obscure a more foundational point: motivation rises and falls in every person, in every culture, under every leadership style. The question is not how to sustain peak motivation permanently, because you cannot. The question is what the system does when motivation is not at its peak.

A well-designed performance system does not depend on the person being maximally motivated to function. It makes the right behaviour easier to choose, easier to measure and easier to return to after a lapse. It creates conditions in which someone who is going through a difficult period can still see where they are relative to the standard, understand what the gap looks like, and take a clear next step without having to reconstruct the whole logic from scratch each time.

This is what distinguishes a system from a motivational intervention. A motivational intervention works while the energy is present. A system works, or at least supports the work, when the energy is lower, because the clarity and the structure are still there holding the shape.

This distinction has practical consequences for how performance products should be built. A product that only works well when the user is already performing is not a performance product. It is a logging tool for people who do not need help. The harder and more valuable problem is designing something that holds the shape of the standard clearly enough, and makes the review honest enough, that it is genuinely useful in the moments when performance is slipping.

The design problem this creates

If you are building a product that is supposed to support human performance, not just track it, but actively support it, you run into a design problem that does not resolve neatly.

The problem is roughly this: personalisation and precision are in partial tension. A standard that is precise enough to generate honest accountability tends to be specific to a context, a person, a history. The standard for what constitutes a good training week for one person is not the same as the standard for another. But a system that attempts to generate personalised standards for every user at every point in their journey is a significantly harder product to build than one that offers a fixed set of well-defined benchmarks.

The softer approach, offering general guidance, encouraging users to set their own targets, letting the product adapt to whatever the user says they want, sidesteps this tension by effectively outsourcing the standard-setting to the user. This is sometimes the right call. But it can also become a way of building a product that never develops the accountability function at all, because the standards are always exactly what the user chose, which means missed standards are always a choice the user made rather than a gap the product surfaces.

The more useful product, harder to build, richer in value, holds a position. It brings some considered view of what the standard should be, makes that view visible, and helps the user understand where they are relative to it with enough honesty that the review is not simply a mirror reflecting whatever the user wants to see.

Where Naira fits into this

Naira is the AI performance coach inside CheekyGains. The design question it is working through is precisely the one described above: how to be a coach that holds a position, not simply a companion that adapts to whatever the user presents.

The coaching problem in fitness and performance is well established. People know roughly what they should do. They know the standards, in terms of training frequency, sleep, consistency across a long enough period, often better than they can articulate. The gap between knowing and doing is not primarily an information problem. It is an accountability problem. The standard exists, at least in latent form. What is missing is a review that is honest, and someone, or something, that cares enough to notice when it is not met.

Language and memory systems make this more tractable in 2018 than it was five years ago. A coaching system with enough context about a user's history can notice patterns that the user themselves may not see clearly. It can ask a more specific question than a generic one. It can make the gap between current behaviour and the stated standard visible without requiring the user to compute that gap themselves.

But the underlying design principle is the same regardless of the technology layer. The system needs to care. The standard needs to be clear. The review needs to be honest. A language model that reflects the user back to themselves without ever surfacing the gap is not a coach. It is a more sophisticated mirror.

The product standard Naira is working toward is simpler to state than it is to implement: be precise, not noisy. Know when to challenge and when to simplify. Increase agency rather than create dependency. If the product makes someone more capable of holding themselves to their own standard over time, it has succeeded. If it makes someone reliant on the product to function at all, it has failed, regardless of the engagement metrics.

Getting the system to close

The most common performance system failure is not bad standards or dishonest reviews in isolation. It is an open loop. Someone misses a standard. The review acknowledges it. Nothing changes. The cycle repeats.

Accountability systems close the loop. They create a clear line between the observation, that the standard was not met, and some consequential next step, whether that is a changed behaviour, a support intervention, a recalibrated target, or occasionally a harder conversation. The consequence does not need to be punitive. But it needs to be something. An open loop, repeated often enough, signals to everyone in the system that the standard was not a standard at all: it was a preference, and preferences are optional.

This is true whether the system is a team, an organisation, a coaching product, or a personal practice. The mechanics are the same. The difficulty scales with complexity, but the underlying requirement does not change.

Standards without accountability are aspirational. Accountability without standards is theatre. Getting both right, in a system that actually closes, is the work.