Airgentic Help
The Conversational Intelligence screen is where you configure the automated topic clustering and category classification that powers the Categories & Themes insights.
This guide covers how to create taxonomies (your own category structures) and configure the topic/theme clustering that automatically discovers what customers are asking about.
Navigate to Conversational Intelligence from the main admin menu. This screen has two main tabs:
The Topics & Themes tab is where you configure and monitor the automated clustering that discovers what customers are asking about.
The Enable Topic & Theme Clustering toggle controls whether clustering runs for this service:
This setting saves automatically when changed. Disabling clustering does not delete existing themes — it simply stops new analysis runs. You can re-enable it at any time.
Note: Clustering is tied to your crawl schedule. When a scheduled crawl completes successfully, clustering automatically runs if enabled. This ensures clustering always has the freshest content available. If you don't have a crawl schedule configured, you can trigger analyses manually.
Below the toggle, you'll see:
Click + New Analysis to trigger clustering for the selected month. This is useful when:
Click Reset Themes to delete all existing themes and clustering history. After resetting, run a new analysis to create fresh themes based on your current data.
Warning: This action cannot be undone.
Expand the Advanced Clustering Settings section to adjust clustering behaviour. For most use cases, the defaults work well.
| Setting | Description | Default |
|---|---|---|
| Min Cluster Size | Smallest number of similar questions to form a cluster | 8 |
| Min Samples | How many similar neighbours a question needs | 5 |
| Cluster Merge Distance | Distance threshold for merging similar clusters (0 = no merging) | 0.05 |
| Cluster Selection | EOM (Balanced) finds stable clusters; Leaf keeps smallest clusters | EOM |
| Distance Metric | How similarity is measured (Euclidean or Cosine) | Euclidean |
| Dimensionality Reduction (UMAP) | Reduces embedding dimensions before clustering | Enabled |
Warning: Changing these from defaults will likely have a detrimental impact on clustering quality unless you have a specific reason to adjust them.
These settings apply to the next analysis run.
A taxonomy is a classification system you define to organise customer questions according to your business structure. Unlike themes (which are discovered automatically), taxonomies give you governed, consistent categories that align with your organisation.
| Setting | Description |
|---|---|
| Name | A human-readable name shown in reports (e.g., "Schools and Departments") |
| Category Type | What kind of categories this contains — helps organise your taxonomies |
| Assignment Mode | Single assigns each question to exactly one category (use for routing). Multiple allows questions to have several categories (use for tagging) |
| Hierarchical categories | Enable if categories have parent-child relationships (e.g., Department > Team) |
| Apply to all services | When checked, the taxonomy applies to all services in your account |
Choose the type that best describes your categories:
After creating a taxonomy, switch to the Categories tab to add your classification labels.
| Field | Description |
|---|---|
| Label | The display name (e.g., "Undergraduate Admissions") |
| Parent Category | Optional — nest under another category to create hierarchy |
| Description | What questions belong here — helps with training |
| Owner | The person or team responsible for this category |
| Owner Email(s) | Email addresses for notifications (comma-separated) |
| Status | Active categories are used; Inactive are hidden but preserved |
| Sort Order | Controls display order (lower numbers appear first) |
Each category can have training sources that help the AI understand what belongs there. Click on a category row to see and manage its training sources.
Training source types:
| Type | Description |
|---|---|
| Fetch pages with URL prefix | Retrieves pages from your indexed content matching a URL pattern. The system extracts relevant text to build training data. |
| Text Brief | Freeform description of what the category covers. Useful when URL-based training isn't available. |
| Exemplar Question | A sample question that should match this category. Add several variations for best results. |
| Negative Exemplar | A question that should NOT match this category. Helps distinguish similar categories. |
The more training data you provide, the more accurate the classification becomes.
Taxonomies start in Draft status, allowing you to refine them before use.
When ready:
The system will:
1. Build training data from your sources (progress shown in the header)
2. Generate semantic embeddings for each category
3. Make the taxonomy available for classification
Once published, certain settings become locked to maintain classification stability. A notice will appear explaining which settings can still be edited:
To change locked settings, you'll need to create a new version.
If you need to change locked settings on a published taxonomy:
The new version will replace the previous one for future classifications.
Once published, click Classify Conversations to:
Classification also runs automatically on a daily schedule (04:00 UTC).
Each taxonomy has several tabs for monitoring performance and reviewing classifications.
The Reports tab shows quality metrics and assignment summaries.
| Metric | Description |
|---|---|
| Average confidence | Mean confidence score across all classifications |
| Assignment rate | Percentage of questions assigned to at least one category |
| No human override | Percentage of assignments that haven't been manually corrected |
| Category clarity | How distinct categories are from each other |
| Cluster purity | How consistently clusters map to single categories |
| Impure clusters | Number of clusters that span multiple categories |
Shows how many questions have been assigned to each category, broken down by:
Lists all categories in the taxonomy with:
Click the action icons to view questions, review assignments, or edit the category.
Shows questions where two or more categories scored nearly identically (within 4%). The top scorer was assigned, but you may want to review these to:
These borderline cases reveal where categories overlap and can be used as training examples to improve accuracy.
Shows questions that didn't meet the confidence threshold for any category. Each item displays:
These may represent:
Click Confirm to assign a question to the selected category.
Each month, the system:
This happens automatically after each scheduled crawl completes, but you can also trigger manual analyses from the Topics & Themes tab.
Note: Because clustering runs monthly, date range filters on the Categories & Themes dashboard may not be exact when filtering topic clusters and themes. The clusters represent the full month's data regardless of the date range selected.
Clustering and classification run automatically based on your crawl schedule:
When a scheduled crawl completes successfully, the system automatically:
This ensures clustering always has the freshest indexed content available.
| Time (UTC) | Process |
|---|---|
| 04:00 | Classification — Assigns questions to published taxonomy categories |
Classification runs daily at 04:00 UTC for all services with published taxonomies. This means new questions are typically classified by early morning each day.
You can also trigger clustering or classification manually at any time: