Rfm Customer Segmentation

by liangdabiao

data

Perform RFM (Recency, Frequency, Monetary) customer segmentation analysis on e-commerce data. Use when you need to analyze customer value, identify VIP customers, or create marketing segments. Automatically cleans data, calculates RFM metrics, applies K-means clustering, and generates visualization reports with Chinese language support.

Skill Details

Repository Files

12 files in this skill directory


name: rfm-customer-segmentation description: Perform RFM (Recency, Frequency, Monetary) customer segmentation analysis on e-commerce data. Use when you need to analyze customer value, identify VIP customers, or create marketing segments. Automatically cleans data, calculates RFM metrics, applies K-means clustering, and generates visualization reports with Chinese language support. allowed-tools: Read, Write, Bash, Glob

RFM Customer Segmentation Analysis

A comprehensive customer segmentation skill that automatically analyzes e-commerce transaction data to identify customer value segments using RFM (Recency, Frequency, Monetary) analysis with K-means clustering.

Instructions

1. Data Analysis

When users provide e-commerce data or ask about customer segmentation:

  • Load and validate the transaction data
  • Clean data by removing invalid orders (negative quantities, zero prices)
  • Calculate RFM metrics for each customer:
    • Recency: Days since last purchase
    • Frequency: Number of purchases
    • Monetary: Total purchase amount
  • Use K-means clustering on RFM dimensions
  • Automatically determine optimal number of clusters using elbow method

2. Customer Segmentation

  • Create customer value segments: High, Medium, Low value customers
  • Score each customer on RFM dimensions (1-3 scale)
  • Calculate overall customer value scores
  • Identify and rank VIP customers for marketing campaigns

3. Visualization and Reporting

  • Generate comprehensive customer segmentation dashboard
  • Create pie charts for segment distribution and revenue share
  • Build RFM scatter plots to visualize customer patterns
  • Generate box plots showing value distribution by segment
  • Export detailed CSV reports with VIP customer lists

4. Marketing Insights

  • Provide actionable marketing recommendations for each segment
  • Generate executive summary with key findings
  • Create customer activation strategies for different value tiers
  • Export VIP customer lists for targeted marketing campaigns

Usage Examples

Basic Customer Segmentation

Analyze these e-commerce orders and segment customers by value:
[CSV data with order_id, user_id, purchase_date, quantity, unit_price]

VIP Customer Identification

Find the top 100 most valuable customers from our sales data for marketing campaign

Customer Value Analysis

Create a customer segmentation report showing revenue contribution by customer segment

Key Features

  • Automatic Data Cleaning: Handles Chinese e-commerce data formats, removes invalid orders
  • Intelligent Clustering: Uses elbow method to determine optimal cluster count
  • Chinese Language Support: Full support for Chinese field names and visualizations
  • Comprehensive Reports: Generates HTML reports, PNG dashboards, and CSV exports
  • Marketing Ready: Provides VIP customer lists and actionable insights

File Requirements

The skill works with e-commerce transaction data containing:

  • user_id: Customer identification code (用户码)
  • order_date: Purchase date (消费日期)
  • quantity: Order quantity (数量)
  • unit_price: Item unit price (单价)
  • product_info: Product details (optional)

Output Files Generated

  • customer_segments.csv: Complete customer segmentation data
  • vip_customers_list.csv: Ranked VIP customer list for marketing
  • segment_summary_statistics.csv: Detailed statistics by segment
  • customer_segmentation_dashboard.png: Visual analytics dashboard
  • data_validation_report.txt: Data quality and analysis validation

Dependencies

  • pandas, numpy for data processing
  • scikit-learn for K-means clustering
  • matplotlib, seaborn for visualization (with Chinese font support)
  • Standard Python libraries for file operations

Best Practices

  • Ensure date fields are in consistent format (YYYY-MM-DD recommended)
  • Remove or handle missing values before analysis
  • Use sufficient data volume (1000+ orders recommended for reliable clustering)
  • Consider business context when interpreting segment results
  • Validate results with domain knowledge when possible

Related Skills

Xlsx

Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas

data

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Clickhouse Io

ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.

datacli

Analyzing Financial Statements

This skill calculates key financial ratios and metrics from financial statement data for investment analysis

data

Data Storytelling

Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.

data

Kpi Dashboard Design

Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.

designdata

Dbt Transformation Patterns

Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.

testingdocumenttool

Sql Optimization Patterns

Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.

designdata

Anndata

This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.

arttooldata

Xlsx

Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.

tooldata

Skill Information

Category:Data
Allowed Tools:Read, Write, Bash, Glob
Last Updated:12/24/2025