
TL;DR
Graphon AI emerged from stealth with $8.3 million in seed funding to build a “pre-model intelligence layer” that uncovers relationships between multimodal enterprise data before arriving at a base model. The round was led by Novera Ventures, with participation from Perplexity Fund, Samsung Next, GS Futures, Hitachi Ventures and others. The company is named after a mathematical concept co-formalized by its technical advisors, UC Berkeley professors Jennifer Chayes and Christian Borgs. Founded by Arbaaz Khan (CEO), Deepak Mishra (COO) and Clark Zhang (CTO), with team members from Amazon, Meta, Google, Apple, NVIDIA and NASA. GS Group (South Korean conglomerate), one of its first customers, has implemented Graphon for convenience store analytics and construction site security.
The name is the indicator. Graphon AI, which emerged from sneaked in on Wednesday with $8.3 million in seed funding, It is named after a mathematical object that most people in AI have never heard of and that his two most prominent advisors helped invent. A graphon is the boundary of a sequence of dense graphs: a continuous function that captures the structure of relationships as networks grow infinitely. It’s the kind of concept that exists on the boundary between pure mathematics and theoretical computer science, and it’s now the basis of a startup that claims to have built the missing layer between business data and the models that are supposed to make sense of it.
The company’s thesis is simple, even if the mathematics behind it is not. Today’s large language models can process about a million tokens at a time. Companies hold billions of tokens in documents, videos, audio, images, records and databases. Augmented recovery generation, the current standard approach, can unearth relevant content from that mass, but it cannot uncover relationships between pieces of data that were never stored together. An LLM using RAG can answer a question about a specific document. You can’t reason about how that document connects to surveillance video, a compliance record, and a customer database, at least not without someone already mapping those connections.
The Graphon product is in front of the model, not inside it. Using graphon functions, a mathematical framework that extends the academic concept to a software layer, the system ingests multimodal data and automatically discovers relational structure across it, producing what the company calls persistent relational memory. The result, in theory, is a representation of an organization’s data that any base model or agent framework can query without being limited by its context window.
The people behind the mathematics.
The founding team consists of Arbaaz Khan as CEO, Deepak Mishra as COO and Clark Zhang as CTO. The company says its broader team includes former researchers and engineers from Amazon, Meta, Google, Apple, NVIDIA, Samsung AI Center, MIT, Rivian and NASA.
Perhaps most notable are the technical advisors. Jennifer Chayes, dean of the School of Computing, Data Science and Society at UC Berkeley, and Christian Borgs, professor of computer science at UC Berkeley, are listed as advisors. Borgs was part of the group of researchers, along with Chayes, László Lovász, Vera Sós and Katalin Vesztergombi, who formalized the graphon as a mathematical concept in 2008. In fact, the company is commercializing a framework that its advisors co-invented.
Chayes and Borgs described the approach in a joint statement as one that treats relational structure as a first-class element of the AI stack rather than something that can be inferred after the fact. The distinction is important because most current AI systems treat data as collections of individual elements that must be retrieved, not as networks of relationships that must be preserved.
An unusual investor table
The seed round was led by Arvind Gupta of Novera Ventures, who made his fund’s first investment in Graphon from its flagship vehicle. Gupta is best known as the founder of life sciences accelerator IndieBio, and his pivot to an AI infrastructure company suggests he sees a structural overlap between the problems Graphon addresses and the complex, multimodal data challenges that define scientific computing.
The rest of the cap table reads like a deliberate exercise in strategic diversity. Perplexity Fund, the $50 million venture arm of the AI search firm, participated along with Samsung Next, Hitachi Ventures, GS Futures (the venture arm of South Korean conglomerate GS Group), Gaia Ventures, B37 Ventures and Aurum Partners, the investment fund affiliated with the ownership group of the San Francisco 49ers.
The mix is revealing. An AI search company, a consumer electronics giant, a Japanese industrial conglomerate, and a Korean chaebol investing in the same pre-model data layer suggest that the context window problem that Graphon aims to solve is felt across industries that would otherwise have little in common. GS Group, which is among South Korea’s largest conglomerates with interests spanning energy, retail and construction, is also an early customer. Ally Kim, vice president of GS, said the company’s multimodal AI solutions have been applied to analyze customer movement in convenience stores and improve security through CCTV analysis at construction sites.
The technical bet
Graphon’s positioning reflects a broader shift in the AI infrastructure market. The last three years have been dominated by a race to build larger models with longer context windows. But even the most capable models still hit a ceiling: They can process more tokens, but they can’t maintain relational awareness across the volumes of data that large organizations generate. The question Graphon is betting on is whether the solution lies not in expanding the context window even further, but in structuring the data before it enters the window.
The company says it has already deployed its platform for enterprise content management, industrial intelligence, agent workflows and device applications across phones, cameras, wearables and smart glasses. The breadth of claimed use cases is ambitious for an early-stage company, and the lack of independent benchmarks or detailed customer case studies beyond GS Group makes it difficult to assess how far the technology has progressed from concept to production.
What is clear is that the problem Graphon describes is real. The gap between what LLMs can theoretically do and what they can actually do with enterprise data remains one of the most significant limitations to AI implementation. Augmented recall generation has been the industry’s primary response, and its limitations—flat recall that ignores relationships between data sets and context windows that impose artificial limits on what the model can see—are well documented. Whether graph functions offer a fundamentally better approach or simply a theoretically more elegant version of graph-based data structuring is the question the enterprise will need to answer as it moves from stealth-mode math to production-grade infrastructure.
The 8.3 million dollars give him a clue to try it. The advisors who co-invented the underlying mathematics give it credibility. But in an AI market with no shortage of startups claiming to have found the missing layer, Graphon’s challenge will be to demonstrate that the math for which it is named translates into a measurable improvement in the way basic models handle the messy, multimodal reality of enterprise data, not just in theory, but at the scale at which theory is no longer sufficient.





