Python Big Data Tutor for HackerU

projects

Achivements

  • Tutored a course on Python for Big Data Analysis
  • Created a 20 weeks course program and held class room studies
  • Assigned and checked a homework exercises

About course on Python for Big Data Analysis

The course emphasizes practical Python data analysis skills with a focus on NumPy, visualization, regex, databases, and BigQuery integration—perfect foundation for your data engineering work!

Here’s a markdown list of topics covered in the Python for Big Data Analysis course:

  • NumPy Fundamentals
    • Basics of NumPy arrays
    • NumPy exercises and operations
  • Data Visualization
    • Seaborn for exploratory data analysis
    • Football analytics with Seaborn
    • London air quality data analysis (london_merged.csv)
  • Telecom Churn Analysis
    • Exploratory data analysis (EDA) of telecom customer churn
    • Data visualization and insights
  • Regular Expressions (Regex)
    • Python regex fundamentals
    • Spam corpus analysis
    • Regex homework exercises
  • Web Scraping and Parsing
    • Web parser implementation
    • Leroy Merlin parser data analysis (Russian retail data)
  • Database Integration
    • SQLite with Pandas and Seaborn
    • Database API connectivity (hackeru-database-db-api.ipynb)
    • Google BigQuery tutorial
  • Version Control
    • Git practice exercises
  • Python Scripting
    • Quote processing scripts (quites1.py through quites6.py)
  • Big Data Tools
    • Introduction to Google BigQuery for large-scale analysis
Denis Trofimov
Authors
Software Architect, Data Architect

Software Architect, Data Architect


Seasoned software developer with experience at startups, banks, and industries like space and railroads.


  • Go, Python, C++, C engineer since 2006.
  • Last 3 years: Platform Engineering, building Internal Developer Portals (IDPs), and shifting organizations left in DevOps.
  • Designed and built standalone and client-server apps with Oracle DB, PostgreSQL, and MySQL.
  • Delivered CRM systems, web-based automated order processing, and simulations for railroad rolling stock operations.