How I'm Thinking About Percepta
Healthcare lens throughout, but most of this generalizes.
Three premises I’m treating as given
-
PR1: Models will keep getting better and the improvement is compounding. Scaling laws have delivered since 2012, and now models are being used to improve the training of the next generation, so the curve feeds on itself. METR task duration doubles every 4 months. The thing that’s hard to internalize: plan for a world where the model is meaningfully better in 12 months and dramatically better in 24.
-
PR2: People and organizations will defer more and more decisions to models. Already happening in consumer (search, writing, coding), beginning in enterprise. As models get more capable the surface area of delegation expands. The bottleneck shifts from “can the model do this” to “can the organization hand this off.”
-
PR3: Two problems stay hard and get harder as models improve:
-
Observability. When an agent takes an action you need to know what it did, why, and whether the outcome was correct. UI/UX for making agent behavior legible to humans. Plotting multi-agent trajectories when the state space explodes. More powerful models take more actions across more domains, which means more to watch. The verification inversion: checking the work gets harder as the worker gets smarter.
-
Verifiability conversion. Most of what organizations do is not structured in a way an AI can execute and a human can verify the result. The work is messy, implicit, embedded in human judgment and social context. How do you take a process that currently lives in someone’s head (or in the space between three people’s heads) and convert it into something where you can check whether the AI did it right? How do you get the human in the loop to actually verify, not rubber-stamp, but meaningfully confirm? I think this is the binding constraint right now. Whether anyone can confirm the work was done correctly is what’s actually gating deployment.
-
-
If these premises hold, the company that matters most is the one that solves the third problem. Not the best model (those commoditize) and not the best vertical tool (delegation moves faster than any single tool). The one that can make organizations legible and their processes verifiable. I could be wrong about this being the binding constraint. Maybe the real bottleneck is organizational identity, political resistance, or just that institutions move on generational timescales regardless of what’s technically possible. But I’m betting it’s verifiability.
Percepta is distinct from the labs
-
Intelligence trains centrally but deploys at the edge. Context degrades the further it travels from where it was generated. Running a frontier model from the cloud to manage a hospital’s billing exceptions is wasteful for the same reason you wouldn’t stream HD to a cloud for a security camera when an edge chip does it at 25x less energy. Percepta is the lab’s edge in healthcare.
-
Anthropic sells tokens, Percepta minimizes token burn per outcome. Percepta generates domain-specific training data from trusted deployments that Anthropic can’t get from API usage logs. It comes from inside institutions, with consent, in structured form. That data may be worth more to Anthropic than the token revenue.
-
I’m less sure about whether this arrangement is permanent. Labs are already hiring forward-deployed teams, building enterprise deployment capability. McGrew (ex-OpenAI head of research) says each enterprise integration problem is “too small” for a frontier lab, but if it gets lucrative enough, do they come for it anyway? I’m betting the lab business model makes granular institutional work permanently unattractive. Honest that it’s a bet.
-
One version of where this goes looks like the OpenAI/Thrive deal. Lab researchers embedded in healthcare deployments, producing training data they can’t get elsewhere, Percepta getting capabilities nobody else has. Alma matters here too. Deploying inside systems GC owns means you observe context at the source instead of negotiating for access.
Percepta is not a holding company
-
The holdcos (LLMH, Thrive, Sequence) buy businesses first, then try to transform them. Many of them have patient capital structures, that’s the whole thesis, holding these businesses long-term with capital designed for it. The operational improvements are real (tasks that took 10 hours now take 1, margins genuinely expand). Does the market eventually reclassify an AI-transformed services company, or does the services label stick? If margins go from 10% to 40%, at what point does the market treat it differently? The holdcos haven’t answered that. The core difference from Percepta is the sequence, not the capital structure. They have to pick targets before they deeply understand what’s transformable.
-
Percepta does it in reverse. Deploy into organizations, build the transformation toolkit, map what’s actually possible, THEN GC can decide what to buy. Percepta has information the holdcos don’t have at the point of acquisition. Transform first, own what you’ve converted. Does having the deepest understanding of a business make you the natural owner or just the most informed advisor? At minimum it means you have information nobody else has when it matters.
-
I keep coming back to this optionality: Percepta can become a holdco in the future, but it gets to build the engine meticulously first. Try things out, learn what works before buying anything. The holdcos are doing the reverse and are forced to be rushed where Percepta can be methodical.
-
GC’s capital leverage makes this possible. GC doesn’t have permanent capital in the Berkshire sense, but it has the scale and fund structure to make bets that pure venture can’t. Like buying a hospital. Partnering with major acquirers to deploy into enterprises going private removes quarterly earnings pressure entirely.
-
Rollups automate to cut costs, which cannibalizes the labor revenue they bought. Percepta should be doing something different. Not making the hospital cheaper but making it capable of things that weren’t possible before at any price. I keep thinking of it as an exoskeleton rather than a replacement.
Percepta is not a typical AI startup
-
Most AI startups converge on the same idea: build a vertical tool and compete for enterprise attention. Meanwhile the labs eat the bigger horizontal problems from above and capabilities double on 12-18 month cycles.
-
There’s a black hole effect happening underneath this. Every vertical AI app, regardless of industry, is being pulled toward the same center of gravity: context management, system integrations, agent runbooks and SOPs, eval sets, tooling. A legal AI company and a healthcare AI company end up building surprisingly similar underlying infrastructure because the problems converge: managing context, integrating with existing systems, maintaining agent operating procedures, evaluating output quality. The vertical differentiation dissolves. If that’s true, a horizontal approach that captures the convergence might be better positioned than being locked into one vertical racing against the model.
-
But there is a massive and growing delta between what the latest models can do and how organizations actually use them. 42% of enterprise AI initiatives discontinued in 2024, up from 17% the year before — more capability producing more failures, which is counterintuitive until you think about it. You can see the computer age everywhere except in the productivity statistics.
-
What separates real from performative: can you generate $1 of real EBITDA for your customers? A lot of companies show compelling demos and pitch decks. Fewer can point to concrete economic value that materialized because of what they built. The ones that can seem to have something in common: they went into the institution and did the work. Not from the screen. Sitting with operators, earning trust, watching people mark up printouts with highlighters. Does that pattern hold universally or just in the verticals I’ve been looking at?
The “every company becomes an API” thesis is insufficient
-
One framing of the future: capabilities are here, just automate everything. That framing captures something real but it’s incomplete. You need to actually deploy these things into organizations that were not designed for them. How much depth do you need for each domain? Is 80% from outside enough for most use cases, or does the last 20% where institutional reality lives matter disproportionately? I think it’s the latter in healthcare and government. Less sure about other verticals.
-
FDE teams make organizations machine-readable. Workflows made explicit, decision criteria and exception paths documented. Converting companies from human-social structures into something AI can execute against. The hospital that’s been made legible can participate in whatever comes next. The one that hasn’t is invisible to it.
-
But this conversion can’t happen from outside. A hospital’s 42-step billing process wasn’t designed by an idiot. It’s organizational scar tissue that encodes real institutional knowledge, built after something went wrong, probably multiple times. Each step is a lesson about a failure mode nobody documented any other way. The process IS the documentation. You have to understand why each step exists, which are load-bearing and which are vestigial, and then convert the load-bearing ones into forms an AI can execute and a human can verify.
The product is amorphous on purpose
-
I haven’t seen or used Mosaic, Lattice, or any of Percepta’s internal tooling. What follows is what I think winning looks like. My vision of what the product should be, not a description of what it is.
-
The way I think about the tooling: treat it all as a treasure hunter’s assets. Mosaic, Lattice, whatever comes next. These aren’t products you sell to clients. They’re how smart FDEs get leverage. Map how the institution actually operates (not the org chart version), connect systems fast, convert implicit processes into verifiable tasks, build eval sets that capture what “good” looks like for this specific institution.
-
For tasks that are already verifiable, you deploy AI, monitor, iterate. For tasks that aren’t, the work is breaking them down into chunks that become verifiable. Taking a messy process loaded with judgment and social context and converting it into discrete steps an AI can execute and a human can meaningfully confirm. That decomposition is Percepta’s core intellectual work. Can it be systematized into a platform, or does it always require senior judgment? That’s what determines if Percepta scales or stays boutique.
-
Whether that decomposition skill transfers across verticals is the open question. The industry has mostly converged on single-vertical integration (one company for legal, one for healthcare, one for insurance), which assumes what you learn making a hospital legible doesn’t help you make a government agency legible. I’m not sure that’s right. The specific domain knowledge doesn’t transfer, but the methodology might. Decomposing processes into verifiable tasks, building golden datasets of what “good” looks like, running reinforcement loops where operators improve the AI as they go. If that compounds across domains, Percepta’s multi-vertical approach is an advantage. If it doesn’t, spreading across verticals is dilution. The answer probably depends on whether Mosaic captures the methodology or just the domain-specific outputs.
-
The observability layer gets harder, not easier:
- Agent behavior needs to be legible to humans in real time.
- When multiple agents interact across a healthcare system, the state space explodes. You need tools to follow what happened, why, and whether the outcome was right.
- Human-in-the-loop is an architectural decision, not an afterthought. Design the entire system so human verification is meaningful and efficient, not a rubber stamp bolted on at the end.
-
Mosaic and whatever other tools Percepta creates are abstractions weaponized by FDEs. The goal is to get systems, processes, interactions, everything a company does operationally, into something legible by the model and verifiable by the model.
-
Deployment complexity gets WORSE with better models. Context velocity (how fast the AI system learns from resolved exceptions) is nearly asymptotic. Meaning velocity (how fast humans metabolize that into trust) is not. Approval chains, sign-offs, processes with legally fixed minimum durations. The gap between what AI can do and what institutions will allow widens with each capability jump. The FDE role evolves but doesn’t disappear. This could be wrong. It’s possible institutions adapt faster than I’m giving them credit for, or that some threshold of model capability makes the trust gap collapse. But the 42% and rising discontinuation rate suggests we’re not there yet.
-
Every playbook written to onboard employee #50 is a Mosaic feature in disguise. Every decision framework that lets a junior FDE act without calling Hirsh is actual IP. If the company treats scaling as overhead rather than product development, the most important IP gets thrown away. That conversion capability is the product, or at least it should be.
What I’d be watching
-
Whether the research lab connects to the deployment business or stays academic. Percepta’s specific advantage over Distyl and Brain Co: in-house research doing model-level work PLUS embedded teams generating curated domain data the labs can’t access. If those connect, that’s a flywheel nobody else can build.
- There’s growing evidence that models trained on curated, credentialed sources outperform general models in domains like healthcare. If that holds, the highest-value research isn’t general architecture but specialized models from curated clinical data.
- Without deployment data, the research is academic. Without proprietary research, the deployment is just consulting. I don’t know what the combination becomes but it’s clearly different from either one alone.
- If the lab’s published work has no production connection to deployments after six months, it’s window dressing.
-
Whether Mosaic is actually learning or just accumulating the feeling of learning. Playbooks get written, dashboards fill up. Does deployment six take less time than deployment one because of something in Mosaic, or because the team got experienced and the playbooks are just a record of that. If the experienced people leave, what does Mosaic actually contain?
-
The self-obsolescence problem. Successful deployments train the client to not need you. The better the team, the faster knowledge transfers. Palantir had Foundry creating platform lock-in. Percepta builds on a foundation model it doesn’t own. The client can hire their own engineers and point them at the same model. Each engagement has a ceiling unless Mosaic accumulates institutional intelligence faster than the client absorbs it.
-
The people question. This might matter more than any of the structural analysis above and I can’t evaluate it from outside.
-
Whether the structural thesis is right about the world but wrong about who captures the value. Deployment complexity persists. That doesn’t mean a startup owned by a holding company is the entity that captures it. I keep wanting to collapse “thesis is right” into “Percepta wins” and I don’t trust how good that feels.
-
The uncomfortable version of the GC relationship: Percepta does the work, GC’s assets get smarter, GC captures most of the upside. I don’t know how you think about that from inside.
One more thing I keep thinking about
-
Follow the premises far enough and you get somewhere uncomfortable. If intelligence commoditizes on the timeline I described, talent eventually stops being the main differentiator for company outcomes. Not immediately. Right now the specific people in the room are why Percepta works. But past a certain point it becomes more about hard power: capital, distribution, owned assets, institutional relationships, data captured over years in structured form.
-
GC’s balance sheet, the hospital, the Anthropic partnership — those are hard-power assets. Most AI companies are betting entirely on talent. Percepta has both, and the hard-power side is the part that survives commoditization. I find that reassuring and slightly unsettling at the same time, since I’m joining for the team. But it might be the strongest version of the positioning and I’m not sure anyone is framing it that way yet.