Audience – Pontiac Wiki

Audience Overview

The Audience model uses demographic regression and persona clustering to understand who is most likely to convert. It identifies which census-based features are predictive of performance and groups users into distinct audience segments. All modeling is based on aggregated, privacy-safe data at the ZIP code level, with no use of cookies or personally identifiable information (PII).

The Dashboard tab highlights which census features predict conversions, along with scored ZIP codes and lookalike expansion targets. The Personas tab segments audiences into clusters with shared characteristics and geographic distribution. These insights are currently available for the United States only, leveraging standardized data from the United States Census Bureau to ensure consistency and accuracy.

Use Cases: Understand, Activate, Scale

Understand Your Audience: Define who your customers are and what drives them to convert.
- Understand Your Audience Before You Launch (Audience Mode)
  - Use site visitor data to uncover who your audience is, where they’re located, and what defines them—so you can build smarter strategy before spending media dollars.
- Find Where Your Best Customers Are (ZIP List Mode)
  - Identify high-performing ZIP codes weighted by revenue or LTV to understand what makes them valuable—so you can prioritize the geographies that matter most.
- Turn Data into Clear, Actionable Audiences (All Modes)
  - Segment users into distinct personas with shared traits and geographic patterns—so you can translate data into clear targeting and messaging.
Activate with Confidence: Turn insights into smarter targeting and campaign execution.
- Plan Smarter Campaigns with Data-Backed Insights
  - Use demographic and geographic signals to guide targeting, messaging, and channel strategy—so every campaign is built on what actually drives performance.
- Connect Personas to Performance (Campaign / ZIP List Modes)
  - Link high-performing ZIP codes to persona segments—so you can activate and optimize at the audience level, not just the campaign level.
Scale What Works: Grow efficiently while maintaining performance.
- Measure, Learn, and Optimize in Real Time (Campaign Mode)
  - Analyze past and live campaigns to identify what’s driving conversions—then optimize targeting and spend while campaigns are running.
- Scale What Works with Lookalike Expansion (Campaign / ZIP List Modes)
  - Use top-performing audiences or geographies to find similar, high-potential segments—so you can expand reach without sacrificing efficiency.

New Report Set Up

Choose how you want to analyze your audience, whether through campaign performance, site visitors, or custom ZIPs, then configure dates, optimization type, and inputs to generate the report results and model.

Instructions for generating a Audience Report and Model. Follow the steps for initial setup below:

Navigate to the Audience Tab.
Click New Report button.
Give report a name.
Select the Advertiser to run the analysis on.
Select the Analysis Mode:
- Campaign – analyze ad performance based on campaign delivery.
  - Requires campaigns and lines with media served.
  - Identifies which audiences and demographics drive conversions.
- Audience – profile site visitors using pixel data only.
  - Requires pixel fires on the advertiser’s site.
  - No campaign or line selection required.
  - Tip: Can be used for pre-campaign analysis to understand your audience before launch.
- ZIP List – profile a custom set of ZIP codes with weights.
  - Model learns which demographics correlate with higher-weighted ZIPs.
  - Tip: Useful for profiling ZIPs based on sales, revenue, or LTV.
Choose whether the report should run once or repeat on a schedule.
- Run once builds a single report.
- Schedule re-runs it on the chosen cadence — each run retrains the model, replaces the live model in S3 when Replace the live model each run is enabled, and is saved as a new run grouped under this report.
  - Rerun Interval:
    - Weekly
    - Every 2 Weeks
    - Monthly
  - Rerun Day of Week:
    - Monday
    - Tuesday
    - Wednesday
    - Thursday
    - Friday
    - Saturday
    - Sunday
  - Replace the live model each run checkbox
    - On: each run’s new model overwrites the live model so it goes live automatically.
    - Off: runs still produce reports and archived models, but the live model is left untouched.
Click Save Report button.

Campaign Mode Example

After selecting Campaign Mode, complete the following:

Select the Campaign(s) within the chosen Advertiser.
- If no campaigns are selected, all campaigns will be included.
Select the Line(s) within the chosen Campaign(s).
- If no lines are selected, all lines will be included.
Optimization Type:
- Conversions: Choose the conversion pixel to be used for optimization.
Select the analysis Start Date.
Select the analysis End Date.
Enter the desired lookback window (in days).
- Default is 30 days.
- This defines how far back the model will attribute conversions or clicks to ad exposure.
Click the Save Report button.

Audience Mode Example

After selecting Audience Mode, complete the following:

Select the Optimization Type:
- Conversions
  - If selected, choose the conversion pixel to be used for optimization.
- Clicks
Select the analysis Start Date.
Select the analysis End Date.
Enter the desired lookback window (in days).
- Default is 30 days.
- This defines how far back the model will attribute conversions or clicks to ad exposure.
Click the Save Report button.

ZIP List Mode

After selecting ZIP List Mode, completed the following:

Enter ZIP codes with weights.
- One per line or comma-separated. Format: ZIP:weight (weight defaults to 1 if omitted).
- Example input: 11215:2, 33138:8
Click the Save Report button.

Report Results

The report results include the pipeline used, date created, when the report started, and when the report completed. The output is separated into 7 sections:

Summary
Dashboard
Lookalike
Personas
Explorer
Charts
Downloads

This information provides a complete audit trail and transparency into the model, allowing users to understand how insights are generated and tie them back to audience behavior and performance, rather than relying on a black-box approach.

Summary

The Summary section provides a high-level overview of the analysis, including the Report ID, Advertiser ID, and Date Range, along with an automatically generated Executive Summary.

This summary explains:

Key demographic drivers of conversions
Audience composition and persona breakdown
Geographic performance and distribution
Lookalike expansion opportunities (if applicable)
Strategic recommendations and action items

Tip: You can input the Executive Summary into your preferred LLM (e.g., ChatGPT or Claude) to quickly generate a presentation deck or case study based on the results.

Dashboard

The Dashboard section provides a visual overview of audience composition and demographic drivers of performance. It highlights which census-based features influence conversions, how audiences are distributed geographically, and how different demographic segments perform.

Dashboard Sections:

Tile Metrics
Most Important Demographic Categories
Feature Impact Distribution — SHAP Beeswarm
Demographic Sub-Feature Impact
- Shows the top three most important features based on your audience dynamically.
Geographic Performance — Click a State to See ZIP Codes
Cost per Conversion by Demographic Category

Tile Metrics

Analysis Model: Campaign Mode — results are based on ad-attributed conversions. Impressions served by your lines that led to pixel fires within the lookback window. The model learns which demographics respond to your ad.
Optimization: Conversions or Clicks
- Conversions: Optimizing for conversions (pixel fires). The model predicts which demographics drive conversion events.
R² (Coefficient of Determination): Measures how much of the conversion rate variance is explained by demographics. 5–15% is typical. Demographics are just one signal among many.
RMSE (Root Mean Squared Error): Lower is better. Measures the average prediction error in the same units as the target (conversion rate).
MAE (Mean Absolute Error): Average absolute difference between predicted and actual conversion rates. Less sensitive to outliers than RMSE.
Scored ZIPs: Total ZIP codes scored with demographic affinity. Example: 1,850 positive, 25,483 negative or zero (this figure changes dynamically).
Total Spend: Total ad spend across all campaign ZIPs in this analysis period.
Avg Cost/Conv: Average cost per conversion across all campaign ZIPs. Lower is more efficient.
Samples: Number of ZIP codes included in the analysis after filtering for (>300 events) and outlier removal.
Features: Number of census demographic features used by the model.
Reliability: Pipeline quality assessment. Passed 3/4 checks — the results change dynamically.

Most Important Demographic Categories

This section shows the total SHAP importance aggregated by demographic category, helping identify which broad demographic dimensions have the greatest influence on the model’s conversion rate predictions. Use this to understand which broad demographic dimensions matter the most.

Chart View

What It Shows:

Each bar represents a demographic category (e.g., Household Income, Age, Housing, Transportation)
Bar height reflects total SHAP importance across all features within that category

Interpretation:

Taller bars indicate categories that have a greater impact on the model’s predictions
Categories at the top represent the strongest drivers of conversion likelihood
Lower bars indicate categories with less influence on performance

Table View

Provides a detailed breakdown of individual features within each demographic category, including:

Category: High-level demographic group
Feature: Specific sub-feature (e.g., income bracket, age range)
Mean Abs SHAP: Overall importance of the feature
Mean SHAP: Direction of impact (positive or negative influence)

Users can download this table as a CSV file for further analysis.

Feature Impact Distribution — SHAP Beeswarm

This section shows how each demographic feature impacts conversion predictions across all ZIP codes.

What It Shows:

Each dot represents a single ZIP code prediction
X-axis shows SHAP value (impact on predicted conversion rate)
Color represents the feature value:
- Red = higher values
- Blue = lower values
Features are ordered by importance (top = most important)

Interpretation:

Points to the right increase predicted conversion rate
Points to the left decrease predicted conversion rate
Color helps identify whether higher or lower values drive performance
Clusters indicate consistent impact patterns across ZIP codes
Wide spread shows variable impact, while tight clusters show consistent behavior

Example Interpretation: This audience skews toward non-married individuals and people aged 30 to 40, who are more likely to convert, while higher-density housing areas tend to underperform.

Households Marital Status: Never Married
- Takeaway: Prioritize ZIPs with higher concentrations of never-married populations as this is a strong positive signal for conversion.
  - Many points extend to the right, indicating this feature often increases predicted conversion rate
  - Higher values (red) are more concentrated on the right, showing that higher concentrations of never-married populations drive performance
  - The distribution is relatively wide, meaning the impact varies across ZIP codes
  - Visible clustering suggests consistent positive patterns in certain regions
Housing Type 10 to 19
- Takeaway: Deprioritize areas with high concentrations of this housing type (mid- to high-density housing such as apartments or condos). Lower concentrations tend to perform more consistently.
  - Points are distributed on both sides of zero, indicating this feature can both increase and decrease predicted conversion rate
  - Red values (higher concentrations) extend more to the left, showing that higher values tend to decrease conversion likelihood
  - Blue values (lower concentrations) are more clustered on the right, indicating lower values tend to increase conversion likelihood
  - The spread is moderately wide, suggesting variable impact across ZIP codes
  - Clusters near the center indicate more neutral or mixed performance overall
Age 30 to 40
- Takeaway: This is a stable, reliable positive signal and a good baseline demographic to include in targeting.
  - Points lean slightly to the right, indicating a generally positive impact on conversion rate
  - Higher values (red) tend to appear more on the right, suggesting higher concentrations in this age group improve performance
  - The distribution is more tightly clustered, indicating a more consistent effect across ZIP codes
  - Less spread means this feature behaves more predictably compared to others

Demographic Sub-Feature Impact

This section breaks down a top demographic category into its individual sub-features to show how each one influences conversion predictions.

Household Income Example

This chart is a SHAP Beeswarm for the Household Income category.

What It Shows:

Each dot represents a ZIP code prediction
Each row represents a specific sub-feature
X-axis shows SHAP value (impact on predicted conversion rate)
Color represents the feature value:
- Red = higher values
- Blue = lower values
Features are ordered by importance (top = most important)

Interpretation:

Points to the right increase predicted conversion rate
Points to the left decrease predicted conversion rate
Color helps identify whether higher or lower values drive performance
Clusters indicate consistent impact patterns across ZIP codes
Wide spread shows variable impact, while tight clusters show consistent behavior

Example Interpretation: This audience skews toward low to mid-income households, which are more likely to convert than higher-income segments.

Household Income $125,000 to $149,999
- Takeaway: Higher concentrations of this income group tend to decrease conversion likelihood, while lower presence performs better.
  - Red points (higher values) are concentrated left of zero, indicating a negative impact
  - Blue points extend more to the right, showing lower concentrations improve performance
  - Wide spread indicates variable impact across ZIP codes
Household Income $25,000 to $29,999
- Takeaway: This income bracket is a strong positive signal when present at higher levels.
  - Red points cluster to the right, indicating higher values increase predicted conversion rate
  - Blue points appear more on the left, showing lower values reduce performance
  - Moderate spread suggests some variability, but generally consistent direction
Household Income $50,000 to $59,999
- Takeaway: A strong and reliable positive segment worth prioritizing.
  - Red points are clearly right-skewed, showing higher concentrations drive conversions
  - Blue points are more left or neutral, indicating weaker performance when absent
  - Wider spread shows impact varies, but direction is consistently positive

Geographic Performance — Click a State to See ZIP Codes

This section provides an interactive map of conversion performance by geography, helping identify where your audience is performing best.

What It Shows:

States and ZIP codes colored by conversion rate
Visual distribution of performance across regions and local markets
Ability to analyze performance at both state and ZIP-level
Toggle between Heatmap and Boundaries view:
- Boundaries: Clearly outlines geographic regions
- Heatmap: Highlights performance intensity across regions

Boundaries View

Heatmap View

Interactions:

Hover over a state to view:
- State Name
- Conversion Rate
- Impressions
- Attributed Impressions
- Unique Conversions
- Unique Users
Click a state to drill down into ZIP level performance – Zip code view:
- Zip code
- Zip code name
- Conversions
- Impressions
- Attributed Impressions
- Unique Conversions
- Unique Users
- Predicted Rate

Zip Code View

Table View

This table contains the underlying data used to power the Geographic Performance map, providing detailed metrics at the ZIP code level. Users can download this table as a CSV file for further analysis.

Each row represents a ZIP code and its associated performance, which is aggregated to render state-level views in the map.

What It Includes:

Geographic identifiers
- Geo Zip
- Geo Region Name
- City
- State ID
Delivery Metrics
- Impressions: Total number of times ads were served
- Attributed Impressions: Impressions tied to users who later converted within the lookback window
- Unique Users: Number of distinct users exposed to ads
Performance Metrics
- Conversions: Total number of conversion events attributed to the campaign
- Conversion Rate: Attributed Impressions / Impressions
- Attributed Conversion Rate: Conversions / Impressions

Example insight:

07103 (NJ) shows a Conversion Rate (2.20%) and Attributed Conversion Rate of 0.30%, indicating performance well above average (0.077% conversion rate from Executive Summary for this data).

Cost per Conversion by Demographic Category

This section shows how cost efficiency changes based on how strongly a demographic category is represented in a ZIP, helping identify which types of areas deliver the best ROI.

What It Shows:

Each demographic category (e.g., Income, Age, Marital Status) is broken into quartiles based on its most important sub-feature
Each bar represents a quartile segment (Low to High)
Bar height represents cost per conversion (CPA)

Interpretation:

Lower bars = cheaper conversions (more efficient)
Higher bars = more expensive conversions (less efficient)
Compare across quartiles to see how performance changes as the concentration of the demographic increases or decreases within a ZIP
Differences across categories show which demographic dimensions drive efficiency vs. cost

Example Interpretation: Balance scale and efficiency by prioritizing ZIPs with strong marital status and age signals, while reducing spend in areas where income signals increase CPA.

Households Marital Status
- Takeaway: ZIPs where this category has a stronger presence are more cost-efficient, while weaker presence is expensive.
  - The Low quartile has the highest CPA ~$4.05, indicating poor efficiency
  - CPA decreases steadily across quartiles, with High segments being much cheaper to ~$1.21
  - Indicates areas where marital status signals are stronger perform more efficiently
Household Income
- Takeaway: ZIPs where income-related signals are stronger (based on the model’s top feature) tend to be less cost-efficient.
  - CPA increases from Low to High quartiles (bins) (~$1.2 to ~$3.15)
  - Indicates stronger income signal areas are more expensive to convert
  - Lower-signal areas are more efficient
Age
- Takeaway: ZIPs with stronger age-related signals are more cost-efficient.
  - CPA decreases from Low to High quartiles
  - Indicates areas where age is more predictive deliver cheaper conversions

Table View

The table provides the underlying data behind the chart, showing how cost and performance vary across demographic category bins. Users can download this table as a CSV file for further analysis.

What It Shows:

Each row represents a demographic category and its top contributing sub-feature
Data is broken into quartile (bins): Low to High based on how strongly that feature is represented within each ZIP
Columns included:
- Category: High-level demographic group (e.g., Household Income, Age, Marital Status)
- Top Feature: The most important sub-feature within that category driving the segmentation (e.g., a specific income bracket or age group)
- Bin: Quartile grouping (Low, Medium-Low, Medium-High, High)
  - Represents relative concentration of that feature within a ZIP, not the value itself.
  - Bins are calculated independently for each category based on that category’s top feature.
- ZIP Count: Number of ZIP codes in that bin
- Impressions: Total impressions delivered across those ZIPs
- Conversions: Total conversions generated in those ZIPs
- Total Spend: Total media spend across those ZIPs

Lookalike

Lookalike targeting uses the trained demographic model to predict conversion rates for all ~33,000 US ZIP codes not in your campaign. ZIPs with demographics similar to your high-performing areas are identified as expansion targets. Confidence tiers reflect prediction reliability — high confidence ZIPs are within the model’s training distribution with consistent predictions across all decision trees.

This section is only available when your campaign is not running nationally and is not already serving all ZIP codes, as lookalike modeling requires out-of-sample areas for expansion.