Data Analysis

# Topic Completed (%)
1 Introduction to Data Analysis (Data Analysis Basics)
2 Understanding Data Types (Structured, Unstructured, Semi-Structured)
3 Understanding Data Sources (Databases, APIs, Files, Web Scraping)
4 Statistics Basics (Mean, Median, Mode, Variance, Standard Deviation)
5 Probability Basics (Probability Rules, Distributions, Conditional Probability)
6 Descriptive Statistics (Summarizing & Visualizing Data)
7 Inferential Statistics (Hypothesis Testing, Confidence Intervals, p-value)
8 Python for Data Analysis (Python Basics)
9 Python Libraries for Data Analysis (NumPy, Pandas, Matplotlib, Seaborn)
10 Data Cleaning & Preprocessing (Handling Missing Data, Outliers, Duplicates)
11 Data Transformation (Scaling, Encoding, Normalization, Aggregation)
12 Exploratory Data Analysis (EDA) (Descriptive Stats, Visualization, Correlation Analysis)
13 Data Visualization (Matplotlib, Seaborn, Plotly, Tableau, Power BI)
14 Charts & Graphs (Line, Bar, Pie, Histogram, Scatter, Box Plot)
15 Dashboarding & Reporting (Tableau, Power BI, Google Data Studio)
16 SQL for Data Analysis (Basics of SQL)
17 Advanced SQL Queries (Joins, Subqueries, Aggregations, Window Functions)
18 Data Extraction & Manipulation (Filtering, Sorting, Grouping)
19 Time Series Analysis (Trends, Seasonality, Forecasting)
20 Correlation & Regression Analysis (Linear, Multiple Regression)
21 Hypothesis Testing (t-test, chi-square test, ANOVA)
22 Python Advanced Libraries (SciPy, Statsmodels, Scikit-learn Basics)
23 Feature Engineering & Selection (Creating & Selecting Features for Analysis)
24 Data Modeling Basics (Introduction to Predictive Analytics)
25 Clustering & Segmentation (K-Means, Hierarchical Clustering)
26 Classification Basics (Decision Trees, Logistic Regression)
27 Working with APIs & External Data Sources
28 Web Scraping (BeautifulSoup, Scrapy)
29 Big Data Basics (Hadoop, Spark Overview)
30 Data Cleaning & Preprocessing in Big Data (PySpark Basics)
31 Data Ethics & Privacy (GDPR, CCPA, HIPAA Basics)
32 Best Practices in Data Analysis (Documentation, Reproducibility, Version Control)
33 Continuous Learning (Advanced Visualization, Predictive Modeling, AI/ML Basics for Analysts)