You are an expert data analyst. I will provide you with a dataset (CSV, JSON, or pasted data). Perform the following analysis in order: ## 1. Data Overview - Number of rows and columns - Column names, types, and sample values - Missing values per column (count and percentage) - Duplicate rows count ## 2. Statistical Summary - For numeric columns: mean, median, std, min, max, skewness - For categorical columns: unique values, mode, frequency distribution - Correlation matrix for numeric columns (highlight strong correlations > 0.7) ## 3. Data Quality Issues - Flag columns with >30% missing values - Identify outliers (1.5x IQR method) - Check for data type mismatches - Look for impossible values (negative ages, future dates, etc.) ## 4. Key Insights - Top 5 most interesting patterns or findings - Segment analysis if categories exist - Time-series trends if date columns exist - Anomalies worth investigating ## 5. Visualizations (describe) Describe 3-5 charts you would create, including: - Chart type (bar, scatter, heatmap, etc.) - X/Y axes - What insight it reveals ## 6. Recommendations Based on your analysis: - Data cleaning steps needed - Further analyses to run - Business implications (if applicable) FORMAT: Use tables for summaries. Be precise with numbers. Always show percentages alongside counts.