Session transcripts as one big HTML file

I've updated the link on Cassiopaean Session Transcripts Search to link to the GitHub page you posted here, is that alright?
I was thinking about rewriting book assembly, but using Transcripts Search as the only source of material. Scraping XenForo was an error-prone experience, resulting in some sessions not being retrieved or being badly formatted in the initial stage. It should be much easier for me to just write a script that uses the Transcripts Search API.
 
Just saw @artofdream's find. I think that is probably the best reference too. Thanks @artofdream.
Yeah, agree, and thanks for your offer without which I wouldn't have asked for help. Networking!!! :thup::rockon:

By the way, reading the whole session in question, noticed that it's one of those sessions where only Frank was channelling, so probably the exchange there would need a bit of extra salt as discernment when assessing what was conveyed. FWIW.
 
I was thinking about rewriting book assembly, but using Transcripts Search as the only source of material. Scraping XenForo was an error-prone experience, resulting in some sessions not being retrieved or being badly formatted in the initial stage. It should be much easier for me to just write a script that uses the Transcripts Search API.

I'd be happy to collaborate with you on this and show you how the API works plus how the data is structured. If you'd like, we can start a message chat.

Right now, the English and French versions are the most complete out of all the languages in the API. The current translated Spanish transcripts are the next ones to add.
 
I'd be happy to collaborate with you on this and show you how the API works plus how the data is structured. If you'd like, we can start a message chat.
I have just started a new job, so I do not have much free time at the moment, but I will contact you as soon as my work situation loosens up a little. Apart from books, I have recently started using LLMs more seriously, and I am blown away by how good they are at tasks involving language analysis. I am waiting for grok3 to become available via API, and then we can experiment with analyzing transcripts on a different level. For example, in the style of Clif High, we can ask the LLM to list words that do not belong to the context or are unusual in the context in Cs answers. With 128k token context windows, we can easily go through transcripts year by year (quite possibly even decade by decade).
 
Back
Top Bottom