Rfm Customer Segmentation
by liangdabiao
Perform RFM (Recency, Frequency, Monetary) customer segmentation analysis on e-commerce data. Use when you need to analyze customer value, identify VIP customers, or create marketing segments. Automatically cleans data, calculates RFM metrics, applies K-means clustering, and generates visualization reports with Chinese language support.
Skill Details
Repository Files
12 files in this skill directory
name: rfm-customer-segmentation description: Perform RFM (Recency, Frequency, Monetary) customer segmentation analysis on e-commerce data. Use when you need to analyze customer value, identify VIP customers, or create marketing segments. Automatically cleans data, calculates RFM metrics, applies K-means clustering, and generates visualization reports with Chinese language support. allowed-tools: Read, Write, Bash, Glob
RFM Customer Segmentation Analysis
A comprehensive customer segmentation skill that automatically analyzes e-commerce transaction data to identify customer value segments using RFM (Recency, Frequency, Monetary) analysis with K-means clustering.
Instructions
1. Data Analysis
When users provide e-commerce data or ask about customer segmentation:
- Load and validate the transaction data
- Clean data by removing invalid orders (negative quantities, zero prices)
- Calculate RFM metrics for each customer:
- Recency: Days since last purchase
- Frequency: Number of purchases
- Monetary: Total purchase amount
- Use K-means clustering on RFM dimensions
- Automatically determine optimal number of clusters using elbow method
2. Customer Segmentation
- Create customer value segments: High, Medium, Low value customers
- Score each customer on RFM dimensions (1-3 scale)
- Calculate overall customer value scores
- Identify and rank VIP customers for marketing campaigns
3. Visualization and Reporting
- Generate comprehensive customer segmentation dashboard
- Create pie charts for segment distribution and revenue share
- Build RFM scatter plots to visualize customer patterns
- Generate box plots showing value distribution by segment
- Export detailed CSV reports with VIP customer lists
4. Marketing Insights
- Provide actionable marketing recommendations for each segment
- Generate executive summary with key findings
- Create customer activation strategies for different value tiers
- Export VIP customer lists for targeted marketing campaigns
Usage Examples
Basic Customer Segmentation
Analyze these e-commerce orders and segment customers by value:
[CSV data with order_id, user_id, purchase_date, quantity, unit_price]
VIP Customer Identification
Find the top 100 most valuable customers from our sales data for marketing campaign
Customer Value Analysis
Create a customer segmentation report showing revenue contribution by customer segment
Key Features
- Automatic Data Cleaning: Handles Chinese e-commerce data formats, removes invalid orders
- Intelligent Clustering: Uses elbow method to determine optimal cluster count
- Chinese Language Support: Full support for Chinese field names and visualizations
- Comprehensive Reports: Generates HTML reports, PNG dashboards, and CSV exports
- Marketing Ready: Provides VIP customer lists and actionable insights
File Requirements
The skill works with e-commerce transaction data containing:
- user_id: Customer identification code (用户码)
- order_date: Purchase date (消费日期)
- quantity: Order quantity (数量)
- unit_price: Item unit price (单价)
- product_info: Product details (optional)
Output Files Generated
customer_segments.csv: Complete customer segmentation datavip_customers_list.csv: Ranked VIP customer list for marketingsegment_summary_statistics.csv: Detailed statistics by segmentcustomer_segmentation_dashboard.png: Visual analytics dashboarddata_validation_report.txt: Data quality and analysis validation
Dependencies
- pandas, numpy for data processing
- scikit-learn for K-means clustering
- matplotlib, seaborn for visualization (with Chinese font support)
- Standard Python libraries for file operations
Best Practices
- Ensure date fields are in consistent format (YYYY-MM-DD recommended)
- Remove or handle missing values before analysis
- Use sufficient data volume (1000+ orders recommended for reliable clustering)
- Consider business context when interpreting segment results
- Validate results with domain knowledge when possible
Related Skills
Xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Clickhouse Io
ClickHouse database patterns, query optimization, analytics, and data engineering best practices for high-performance analytical workloads.
Analyzing Financial Statements
This skill calculates key financial ratios and metrics from financial statement data for investment analysis
Data Storytelling
Transform data into compelling narratives using visualization, context, and persuasive structure. Use when presenting analytics to stakeholders, creating data reports, or building executive presentations.
Kpi Dashboard Design
Design effective KPI dashboards with metrics selection, visualization best practices, and real-time monitoring patterns. Use when building business dashboards, selecting metrics, or designing data visualization layouts.
Dbt Transformation Patterns
Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices.
Sql Optimization Patterns
Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.
Anndata
This skill should be used when working with annotated data matrices in Python, particularly for single-cell genomics analysis, managing experimental measurements with metadata, or handling large-scale biological datasets. Use when tasks involve AnnData objects, h5ad files, single-cell RNA-seq data, or integration with scanpy/scverse tools.
Xlsx
Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.
