Ken DengReading through hundreds of Discord messages after a playtest feels like panning for gold in a river...
Reading through hundreds of Discord messages after a playtest feels like panning for gold in a river of mud. You know valuable insights are there, but your time is limited and your signal‑to‑noise ratio is terrible. AI automation can turn that flood of feedback into a clean, prioritized list of what to fix and what to build next.
The key principle is categorical separation: you must first decide what you’re looking for. From my work with dozens of indie teams, the most effective framework starts with just two clear categories:
When you define these categories with your own game‑specific examples, you give AI a target. Without this framework, LLMs will lump “the dragon boss is unfair” with “add a fishing minigame” into the same “feedback” bucket.
You don’t need to build a custom NLP pipeline. A tool like GPT‑4 (or any modern large language model) can classify comments with high consistency when given a short prompt that includes your category definitions and a few of your own examples. Its purpose: to read 10,000 comments in minutes, tagging each as “feature request,” “balance issue,” or “other.” That’s the scaling advantage you can’t replicate manually.
A player types: “The Frost Staff is useless compared to the Fireball.” The AI sees “useless” and “compared to” – classic comparative‑power language – and flags it as a balance issue. Another player writes: “Can we have a map for the forest dungeon?” The AI detects “Can we have” (feature‑request syntax) and routes it to the feature request pile.
Define your categories with real examples. Write 5‑10 gold‑standard examples of feature requests and balance issues drawn from your game (e.g., “Frost Staff vs Fireball” for balance, “co‑op mode” for feature). Feed these into your AI prompt.
Centralize your feedback. Export Discord logs, forum threads, and survey responses to a plain‑text file or a simple database. The more data you include, the better the AI can detect patterns.
Run a weekly classification job. Use a script to send batches of new comments to the AI, store the results (comment text + category + confidence), and generate a ranked report. Focus first on balance issues repeated by 10+ players — those are friction points, not noise.