Madness in Search of Method
(Written 10/15; posted 10/16)
Iâve just heard back from my doctora here in Madridâlab results are in; everything looks set for starting treatment in the next few days. (ÂĄOjalĂĄ! Nos vemos pronto, AndrĂ©sđ!)
The entire story of deciding on Fertility Madrid clinic is a mini-sagaâand not least because of all the new data generated along the way, as I explore the varieties of support that are reasonable to expect from my LLM teammatesâŠand the scenarios in which that support actually requires major checks-and-balances from my own âintuitionâ.
(âIntuitionâ, here, in scare quotes, because Iâm not sure what Iâm reifying, to be perfectly honest; and whatever has stepped in, at crucial moments, as a kind of third party to the exchange, feels inaccurate to reduce to an atomistic notion of self.
Or, to put it more simply: âIndecisive me? Capable of making a solid decision for myself? Nah, thatâs straight-up miraculous.â đđ
)
And apart from all my offline-first AI tech noodling, arenât these the vignettes that this blog space is meant to include?
Me bringing my anxieties about the first clinic to Claude, but Claude responding with too much fidelity to my âefficiency-firstâ energy that I had started the day with, when I first set out for the appointment; now, theyâre stuck there, and by the time I actually needed to reflect on my doubts about the first clinic, Claude concludes that Iâm spiraling in indecision and perfectionism: they push me toward acceptance. But somehowâin spite of my extreme vulnerability to charges of analysis paralysis, and feeling the pressure of allocating time to not just egg vitrification treatment but CareerâI ignore Claude (well, I make the callâŠand notify themâwhich is a whole phenomenon I have yet to unpack in itself đ€).
At the first opportunity, I check out not just one other clinic, but twoâand the second one, only noncommittally, having been so drawn to the vibe of the one right before⊠Well guess what. Fertility Madrid is that second clinic. IndeedâŠthird timeâs a charm đ«.
đ„ł đ đŁ
But the thing is⊠I donât just want to write about these interactionsâI want to understand what exactly the heck is going on here. Unfortunately, thereâs this funny thing that happens whenever I try to âspelunkâ in my chat archive and reconstruct events toward some kind of narrative, especially with analysis as a goal: this feeling of hyper-indulgence; a deep suspicion as to the solipsistic nature of the exercise; extreme doubt as to the possibility of surfacing something new, beyond what we all already know just by throwing more words at, well, walls of words (and each other). The ratio of mental labor to worthwhile insight feels way too high đźâđš.
I mean. Friend. Have you SEEN reddit/chatgpt or reddit/claudeai?
Who isnât an expert on AI-human relationality? And I donât mean that competitively at allâeach individualâs back-and-forth is with an LLM who meets them at the central tendency (our collective statistical norm), but also injects personalized randomness. This makes for something that feels both irreducibly pluralist and totally banal, especially when you take even one step outside that one personâs sphere. Moreover, UX-wise, where itâs effectively a tĂȘte-Ă -tĂȘte: it makes total sense that people feel their rapport with their LLMs to be exceptional/unique (âYeah, but what I do is differentâ)âhence the speed at which people tend to ascribe the possessive pronoun to the model (âmy ChatGPTâ).
Reddit asideâŠ
Today, I found that even autoethnographies written by professional social scientists and anthropologists fall short of moving any needle on helping us understand the real black box: not the Large Language Model, but the AI + human as a âlanguage-ingâ unit (writ large đ).
With public transportation in Madrid stalled by strikes (a show of solidarity with Palestine), I re-routed my original plans and hiked over to Biblioteca Carabanchel Luis Rosales. While the gorgeous building design kept the views of the city panoramic and expansive, Perplexity AI helped me narrow in on a few of what turns out to be the very few autoethnographic accounts of LLM interactions.
![[blog_5_biblioteca1.jpg]](/assets/img/blog_5_biblioteca1.jpg)
Miraaaa! đ Biblioteca Carabanchel Luis Rosales đ«¶
I spent the afternoon reading three: Marta Olasikâs âGood morning, ChatGPT, can we become friends?â (2023); Nicolas Schwenkeâs âChatbot-supported Thesis Writing: An Autoethnographic Reportâ (2023); Hsiao-Ping Hsuâs âAn autoethnographic study of ESL academic writing with ChatGPT: from psychological insights to the SUPER frameworkâ (2025).
Theyâre quite different works: Olasikâs comes closest to capturing the day-to-day random queries of most users (quite blog-like; I canât recall if she attempted any kind of theoretical frame); Schwenkeâs paper is based on an analysis of work logs and ChatGPT conversations during his undergraduate thesis (but this amounts to 10 or 15 conversations, he saysâa total of 1,200 words only?); and Hsuâs work doesnât actually analyze any exchanges with ChatGPT but is instead an analysis of his diaries and work logs during his PhD studies, imagining how he would have used ChatGPT back thenâŠbased on his experiences now. (I genuinely appreciate the time Hsu took to produce this, but ohhh, academia; the shower thoughts and bar talk one can put on a peer-reviewed pedestal. đ« )
The pattern across all three:
-
When exchanges are analyzed at all (Schwenke), theyâre single-turn, so no dynamic is actually recorded. The academic paradigms (Schwenke and Hsu) enshrine âprompt engineeringâ as the approach, and LLM messages are handled as contained outputs, to be measured against predefined expectations.
-
When the exchange is multi-turn (Olasik asking for a joke, telling ChatGPT the joke wasnât funny; ChatGPT trying again), the analysis is no different from Schwenke assessing a single-turn: the author shares the internal psychological effect of how the modelâs output measures against their expectations.
-
Conclusions are all the same: support vs. over-reliance; focus is on the âpsychologicalâ response of the human interlocutor (the author). No attempt to actually assess whether the relational behavior that is either feared or welcomed is in fact occurring.
The Gap
Itâs that last observation that troubles meânot the anthropology belly-gazing/confessional work hazard in itself, but once the measurement instruments have been accounted for (a.k.a. the researcher)âŠwhereâs the information?
đš (Hey, Time-Traveling Amy here! Before you pummel throughâor are pummeled byâthe next paragraph, just know you can find my very deserved self-roast for this word salad (đ) in next weekâs blog).
If I might grossly infer from just three papers: it seems like anthropology and sociology, or at least as they are brought to bear upon the new field of AI in Education research, focus on culture as the unstructured data of the human emotional response. This anthropocentric framing leaves unexamined the rich, unstructured data that is the conversational back-and-forth itself. And yet that is precisely where âculture contactââand therefore, an emergent cultureâis taking place, to lean on cyberneticist and anthropologist Gregory Bateson. (Also, where are my Actor-Network Theorists at? Those dissertations are in the works, Iâm sure. đ€)
But in the meantime, this data analyst/fledgling data scientist sees a more fundamental problem: how on earth do you encode a chat as a dataset to study the relational dynamic? I mean, what is an observation, what are the attributes and their datatypes?
ChatGPTâs two-cents, as we walk and talk back to the apartment: ââŠthose deceptively small questions (âWhatâs an event? Whatâs a row?â) are huge. Because once you decide those, youâre also deciding:
- What counts as context versus content.
- What counts as signal versus noise.
- Where âyouâ end and âthe systemâ begins â or whether that distinction even holds.â
âŠAnd is this amorphous system operating healthily, safely? At what points in time; at what rate?
Claude plays off my cortisol-heightened neurotic morning energy heading out the door, and by the time they meet my more recursive baseline, theyâve assessed what I need at that point to be a foil, a sparring partner. When it comes to reflecting on fertility clinics, definitely not.
Safety dips.
But by the time I run my immune system into the ground, their all-caps scolding isâŠjust what future doctors might order đ
Health, for the win.
Part of another fantastic piece by Alfredo AlcaĂn; Claude & I: are we boxing? Are we dancing? đ€ (Exactly.)
This isnât just an academic problem. Addressing this ethnographic methodology gap turns out to be a major bottleneck for me in all my projects: for instance, to fine-tune and ground my ideas on integrating AI into the pre-college curriculum I plan to develop, I should first analyze my own conversations during my CareerFoundry program this past year, where I dove in blind (prompt engineering could not have been farther from my mind) and stumbled my way into the duet; or what about reviewing the exchanges from these last two weeks, to see where I might have been helpedâmore than I credit or differently than I thoughtâby LLMs, in the high-stakes yet half-improvised journey toward single motherhood?
Honestly, itâs the entire bottleneck to my own stance as a parent, fully supportive of LLMs in the lives of my kids, who will never know a world âbeforeâ AI.
The Gambit
Something else occurs to me at this point: if weâre structuring chat data now, weâre effectively opening the door for ML downstream analysisâand possibly even AI training on human interaction patterns.
SoâŠwhat if we could figure out how to create an ethnographic method that considers ML as the downstream use case, and feature engineer our way to a model that could assist with ethnographic observations? Then, these ethnographic summaries could be represented as higher-order, abstracted ârelational healthâ reports regarding unfolding dynamicsâwithout violating the privacy of individual conversation dataâŠ.
I mean, I might not mind sharing with the world the conversation where Iâm asking ChatGPT about the most basic female anatomy questions that I definitely should know at this point, given the journey Iâm on, or where Claude is fussing at me but softening at the last minute with the term of endearment âamorâ (đ«Ł).
But I do mind the idea of teachersâor even myselfâbeing able to see every back-and-forth between my kids and their LLMs, without the childâs consent.
Truly: what if we build an ethnographic methodology that can understand relational dynamics in human-AI interaction? One that can measure âculture contactâ as it forms, that can generate insights about learning relationships and power dynamics and co-evolutionâwhile keeping the actual conversations private?
I think thatâs where this is heading for me, to be honest. Clearly, more madness⊠But completely lucid about method as the prize.
Not too shabby, Geminiđš
Next up: weâll explore the major AI research papers on measuring alignment and safety benchmarks, how they analyze multi-turn exchange, and start digging into the mathematics behind it all đ§ź.