Session transcripts as one big HTML file

KJS · Mar 3, 2025

Pecha said:
I've updated the link on Cassiopaean Session Transcripts Search to link to the GitHub page you posted here, is that alright?

I was thinking about rewriting book assembly, but using Transcripts Search as the only source of material. Scraping XenForo was an error-prone experience, resulting in some sessions not being retrieved or being badly formatted in the initial stage. It should be much easier for me to just write a script that uses the Transcripts Search API.

msasa79 · Mar 3, 2025

goyacobol said:
Just saw @artofdream's find. I think that is probably the best reference too. Thanks @artofdream.

Yeah, agree, and thanks for your offer without which I wouldn't have asked for help. Networking!!! :thup:

By the way, reading the whole session in question, noticed that it's one of those sessions where only Frank was channelling, so probably the exchange there would need a bit of extra salt as discernment when assessing what was conveyed. FWIW.

Pecha · Mar 5, 2025

KJS said:
I was thinking about rewriting book assembly, but using Transcripts Search as the only source of material. Scraping XenForo was an error-prone experience, resulting in some sessions not being retrieved or being badly formatted in the initial stage. It should be much easier for me to just write a script that uses the Transcripts Search API.

I'd be happy to collaborate with you on this and show you how the API works plus how the data is structured. If you'd like, we can start a message chat.

Right now, the English and French versions are the most complete out of all the languages in the API. The current translated Spanish transcripts are the next ones to add.

KJS · Mar 9, 2025

Pecha said:
I'd be happy to collaborate with you on this and show you how the API works plus how the data is structured. If you'd like, we can start a message chat.

I have just started a new job, so I do not have much free time at the moment, but I will contact you as soon as my work situation loosens up a little. Apart from books, I have recently started using LLMs more seriously, and I am blown away by how good they are at tasks involving language analysis. I am waiting for grok3 to become available via API, and then we can experiment with analyzing transcripts on a different level. For example, in the style of Clif High, we can ask the LLM to list words that do not belong to the context or are unusual in the context in Cs answers. With 128k token context windows, we can easily go through transcripts year by year (quite possibly even decade by decade).

romseguy · May 2, 2025

Hello, if this has been posted elsewhere feel free to delete, the wave in one HTML file that I like to use as a convenient way for looking up keywords :

https://files.lebonforum.fr/wave.html

Session transcripts as one big HTML file

KJS

The Living Force

msasa79

Jedi Master

Pecha

Jedi Council Member

KJS

The Living Force

romseguy

The Force is Strong With This One

Trending content