discovering the postrat canon in the community archive | lab notes #4
canonical stories!
The work referred to in this post can be found here.
Figure: Mini map of the memetic landscape - one point per strand
Last time, Alexandre and I wrote about how we derive strands of narrative from single tweets.
Then we looked in detail at a single strand we got from Rich’s great diagram of hosting as increasing agency.
Okay, we got a strand. Now get EVERY STRAND. Or rather, every strand we can find easily. We know quote tweets are a great metric so we take the top 100 tweets that were most quoted by community archive users. This technique gives us for free the collection of quote tweets and their threads. They form a mini-corpus commenting on the original tweet, refining it, critiquing it or referencing it in new contexts.
To complement this corpus with content outside of quote tweets we used semantic search to find tweets related to the root. This way, we can find what came before the root so we can reconstruct the (partial) story setting the stage for the banger. Our qualitative estimate is that ~ 70% of the information value comes from quote tweets, and 30% for semantic matches, concentrated on semantic matches before the root.
These are the raw materials for our strands, the basis of our automated analysis.
Our main priority was to curate the data to find the best strands, and within a strand the tweets that are the core of its narrative. We had an LLM interview us about what makes a set of tweets a good strand of narrative. Among other things, we care about
Cohesion: The tweets are actually about the same thing or relevant to the same story.
Evolution: The featured idea evolved, or story state changed.
Utility: the story features ideas or projects that are useful, either to the community or to the world at large - like scientific knowledge, art, tools, infrastructure, etc.
Figure: Our prompt
We manually analyzed and narrativized 3 strands as few-shot examples.
Figure: Our manual analysis of MasterTimBlais’ tpot norms strand!
Then we had Claude Sonnet 4.5 go through each strand, rate them according to our criteria, summarize them, and highlight 10 essential tweets as narrative touch points for each.
results
The top strands are exactly what we would expect them to be. Visa’s domino meme on catalyzing a golden age has a 9/10. You could argue it’s the main teleological meme in tpot, so it’s looking good.
Another one is around Visa describing the threading game, which you can argue is the characteristic mode of posting of this scene.
Another is “focus your time and energy on what you want to see more of”, conveying the (very postrat) skill of hyperstition.
18 of the top 35 strands above 8/10 score in the list are Visa’s. Part of me wonders if there’s some implementation weirdness at play, but this list isn’t ranked by number of QTs, strands are rated by Claude. Visa is just prolific and central to the scene.
I would also note that we are missing accounts I would consider important in the scene, like vgr, nickcammaratta, aella_girl, tszzl, liminal_warmth, michelcurzi, deepfates, selentelechia.
I’m really excited about this collection! When my friend Brent pushed me to start the community archive, our main desire was to preserve and see the “tpot canon”: the main ideas and stories and projects that have come out of our scene over the past 5 years.
quality control
Now we have it and there’s a decent amount of work in just going through all of them and gauging quality. I think strand summaries tend to be pretty good, but not always flawless. E.g. I would say Christine’s guide to tpot is a significant artifact but it gets a score of 4. This is an effect of i) our rating system that assess the quality of the discourse in the strand, not the artefact itself (we also don’t read link content) and ii) the way we gather strand data, as Christine’s guide is part of a bigger story about TPOT definition and guides that didn’t show up in quotes or semantic matches.
I also see some tweets highlighted sometimes that aren’t especially connected. Like this one with a culture war tone from someone who shares only 17 mutuals with me.
I’d say this happens because we’re factoring in tweets that are streamed live from people who use the community archive extension. Right now, we’re getting most tweets that users see. It could be a good time to filter only for users who’ve opted-in to being streamed. It would give us a denser higher signal core. Long-term with adoption, the strategy of relying on the self-selection of users for quality will stop working and we’ll want to do some community-finding to filter our analysis.
Figure: Essential tweets timeline, with explanations of what’s relevant about each plot point!
Figure: If you click “Explore seeds” in the corner you’ll get to see all the underlying data in the seed rather than just the essentials. It may take a minute to load.
rough themes among strands
I’m excited to see if we can weave different strands together into a cohesive tapestry. Could be cool to just get all the summaries and important tweets and throw them into an LLM.
On a first, non-exhaustive pass, I’m surprised I don’t see a Jhana strand, but we see some amount of
dharma / inner work :
community building and scenius:
interpreting magic and spirituality and woo through materialist lenses:
the eigenprompt
the Community Archive and twitter tools!
scene-steering
on illegibility
A big value in old school tpot was illegibility. I haven’t stopped contending with the fact that the community archive and my research increase legibility. They make it easier to describe their subjects. I am fascinated with my online intellectual home and that leads me to want to understand it. The question of whether it’s good for it to be understood in public is still out.
My sense is that the internet is already a dark forest and tweets are already public, so anyone with sufficient resources and willingness could do it. However, I am doing it. I think there’s an argument that the scene should be strong to be ok in the public eye, and that understanding itself and its main ideas lets it tell its own story better which increases self-determination.
The original argument for illegibility was that the scene was young and needed to incubate but I think we’re far past that point. If anything, there’s a sense of stuckness that I hypothesize comes from having too much of a backlog of unintegrated insights, and this work is part of metabolizing them. Separating the wheat from the chaff, distillation that facilitates digestion. Literally a digestif.
I’ve personally gotten a lot of value from existing in this part of twitter - friends, partners, co-founders, worldview updates, practices, funding - and believe that lots of juice has been produced here. There are many sub-cultures and ways of articulating the main value generated here. I think there are 2 things:
Tpot as a developmental container - people move through tpot, find their people, raise one another up, become more powerful, do projects, etc.
Postrats are uniquely positioned to unify the right and left brain - yin with yang - subjective and objective, spirituality and science. The most clearly useful version of this is pragmatic dharma and spiritual practices articulated in materialist terms, often with computer science metaphors. This very obviously potentiates the developmental container because it makes emotional tools accessible to smart and ambitious people that wouldn’t have them otherwise.
Concluding
It’s still unclear to me how this directory will be valuable to others, but I’m feeling really good about my work so far. I realized 60% of my desire of seeing the postrat canon as a whole, and I feel like I’m fulfilling the work I described in my funding proposal: We have embedding infrastructure, we’re testing our tools on past narratives, we’re iterating with the community by posting these lab notes.
I think letting the scene write its own story while resourced by a bird’s eye view of its discourse over the years is a pretty interesting idea. Will this help (some of) us chart a course? Reaffirm our enthusiasm and confidence that there are things to realize?
An interesting question is: what about real-time and the future? We rely pretty heavily on quote-tweets for these strands, and the closer you are to the present, the less meaningful QTs are, because they’re especially meaningful when they happen far away from original posting time, since they signify the OP is worth referring to. I haven’t given this much thought but I can see a world where we develop proxy metrics for QTs, or simply develop higher time-resolution methods for spotting promising emerging memes. No replacement for the real human taste that is leveraged in QTs, so we may want to involve more humans in the process somehow.
Again, I’m really happy with our work. Excited for you to read and explore the strands! Check it out here.
We become more powerful as we get more data so please install the CA stream extension to stream tweets you see to the archive. Consider also requesting your twitter archive and uploading to the Community Archive.
PS: I’m raising for a 10x more ambitious project so if you work with or close to organizations like Cosmos, ARIA, Astera, Renaissance Philanthropy, I’d love to talk to you.






























newsletter link to the app was wrong, here's the right one: https://bangers.community-archive.org/best-strands