In data analysis, we often obsess over numerical data: "Retention is 30%," "LTV is $5." But numbers only tell you how much. To understand who your users are, you need categorical data. This guide explains how to leverage categorical attributes—like User Roles, Acquisition Sources, or Device Types—to perform the kind of deep segmentation that drives double-digit growth.
I. Back to Basics: What is Categorical Data in Mobile Apps?
Data comes in two main flavors. To use a sophisticated analytics tool effectively, you must understand the difference.
-
Numerical Data: Quantitative values that can be measured.
-
Examples: Session duration (seconds), Gold coins earned (amount), Level reached (number).
-
-
Categorical Data: Qualitative variables that describe characteristics or groups.
-
Examples:
-
User Role: "Warrior" vs. "Mage".
-
Acquisition Source: "Facebook" vs. "Organic".
-
Subscription Status: "Free" vs. "Premium".
-
Device Type: "iPhone 15" vs. "Samsung S24".
-
-
The Strategy: You cannot "average" categorical data (you can't have an "average user role"). Instead, you must use it to slice and dice your numerical metrics.
II. Why "Averages" Kill Product Strategy
If you analyze your app without splitting by categorical data, you are looking at a phantom.
The "Average" Fallacy:
-
Data: Your Event Analysis shows the average "Time to Complete Tutorial" is 10 minutes.
-
Conclusion: "The tutorial is pacing well."
-
The Reality (Split by Category):
-
Category A (RPG Veterans): Finish in 3 minutes (Bored).
-
Category B (Casual Players): Finish in 17 minutes (Frustrated).
-
By ignoring the category, you miss the opportunity to optimize for both groups. You need to segment them.
III. The SolarEngine Workflow: Turning Categories into Segments
SolarEngine is built to handle categorical data natively through its User Analysis and Distribution Analysis models. Here is how to apply this in practice.
Step 1: Define Your "Tags" (User Segments)
First, turn your categorical data into actionable groups. In SolarEngine, use the User Segments function to create Tags.
-
Scenario: You want to analyze high-value players.
-
Action: Create a Conditional Tag for users where
VIP_Level(Categorical Property) is greater than 0. -
Result: You now have a persistent segment called "VIP Users" that you can apply across all reports.
Step 2: Analyze Frequency with Distribution Analysis
Use the Distribution Analysis model to see how different categories perform specific actions.
-
Goal: Understand payment behavior.
-
Configuration: Set the event to "Purchase." SolarEngine will group users by frequency: "0 times," "1-3 times," or "5+ times".
-
Insight: You might find that users with the categorical attribute
Source: TikTokare highly represented in the "0 times" bucket, whileSource: Google Searchusers dominate the "5+ times" bucket.
Step 3: The "Crowd Comparison"
This is SolarEngine’s killer feature for categorical analysis. Use Users Analysis to run a Crowd Comparison.
-
Action: Select Group A (e.g., Category: "Tablet Users") and Group B (e.g., Category: "Phone Users").
-
Compare: Look at their attribute distributions side-by-side.
-
Discovery: You discover that "Tablet Users" have a 40% higher retention rate but a lower conversion rate.
-
Strategy: This categorical insight tells you to optimize the UI layout for tablets to fix the conversion bottleneck.
IV. Advanced Tip: Switching the "Analysis Entity"
Most tools force you to analyze categorical data only at the "User ID" level. SolarEngine allows you to switch the Analysis Entity.
-
Scenario: You are a B2B SaaS app, and you want to analyze behavior by "Company" (Account ID) rather than "Employee" (User ID).
-
Feature: Switch the entity to Account ID.
-
Result: You can now analyze categorical data like "Industry Type" or "Company Size" to see which types of companies are most likely to churn.
Categorical data is the lens through which you see your users clearly. It transforms a generic "User" into a "Warrior from Brazil who loves PvP." By using SolarEngine to map these categories into User Segments and Distribution Models, you move beyond vanity metrics and start building a product that resonates with every specific slice of your audience.
