I Built an AI to Read Reddit
The content engine - tech meets health
I built an AI system that reads thousands of Reddit posts, TikTok videos, and Instagram comments to find the health questions nobody is answering.
It's open source. Here's how it works. 🧵
The problem: there are millions of people asking real health questions in comment sections and forums.
"Is this safe with my medication?" "Why did this work for everyone but me?" "My doctor says I'm fine but I feel terrible."
Most of these go unanswered. Or answered badly.
So I built a pipeline:
Step 1: Collectors pull data from Reddit, TikTok, Instagram - top posts, comments, questions.
Step 2: AI classifies each one - topic, theme, whether it's a question, what the underlying concern is.
Step 3: Clusters similar questions across platforms.
Step 4: Ranks by engagement + how UNANSWERED the question is.
The output: a ranked queue of content ideas driven by what real people actually want to know.
Not what I think is interesting. Not what gets the most likes. What has the biggest GAP between demand and supply.
Example findings:
- Questions about supplement interactions with medications = massively underserved
- "Is X safe long-term?" = the #1 question pattern across every platform
- Genetic testing confusion = huge demand, almost no good answers
- Cost/accessibility concerns = barely addressed by any creator in this space
The whole thing is on GitHub. TypeScript, SQLite, Claude for the AI layer.
If you make content - health or otherwise - fork it. Point it at your niche. Let the data tell you what to make instead of guessing.
github.com/wkasel/content-engine
This is what I put in The Manual every week.
Subscribe to The Manual →