My Professional Experiences

Summary

Data Scientist with expertise in machine learning, data analysis, and building predictive models using Python, Scikit-learn, and PyTorch. Skilled in developing ETL pipelines, working with cloud platforms (AWS, GCP), and automating workflows. Proven ability to collaborate with teams to drive data-driven insights, improve conversion rates, and enhance user engagement.

Skills

  • Programming Languages: Python, R, SQL, HTML, CSS, Swift UI
  • Python Libraries: NumPy, Pandas, Polars, Scikit-learn, PyTorch, TensorFlow, PySpark, Matplotlib, Seaborn, NLTK
  • Data Analysis & Visualization Tools: Excel, Tableau, Mixpanel, Superset
  • Databases: MongoDB, MySQL, Redis, Google BigQuery, Hive, Milvus
  • Cloud & DevOps: GitLab, Linux, Amazon EC2, Amazon RDS, Google Cloud Storage (GCP), Docker
  • Industry Skills: Data Analysis, ETL Pipelines, Product Sense, Team Collaboration, Project Management

Professional Experience

Data Scientist

Company: The Epoch Times, New York, NY
Dates: Jun. 2022 - Present

  • Developed predictive models to enhance user engagement and subscription rates.
  • Implemented personalized content recommendation systems for email newsletter users.
  • Collaborated with cross-functional teams to improve data tracking and analytics processes.
  • Analyzed user data to optimize app engagement strategies.
  • Presented key insights to senior management to inform strategic decisions.

Data Science Intern

Company: Deledao, Santa Clara, CA
Dates: Sep. 2021 - May 2022

  • Developed machine learning models to classify and categorize web content.
  • Enhanced content analysis processes to support educational initiatives.

Investment Analyst (Part-Time)

Company: Brunswick Advisers, New York, NY
Dates: May. 2021 - Present

  • Automated trading processes and portfolio management tasks using programming tools.
  • Developed pipelines for testing and analyzing trading strategies.
  • Conducted data analysis to identify investment opportunities.
  • Prepared data-driven presentations to inform investment decisions.

Projects

Machine Learning Projects

  • Used Car Price Prediction

    Description: Developed a machine learning model to predict used car prices using extensive feature engineering and hyperparameter tuning. Achieved a final RMSE of 72239.79 with LightGBM on the Kaggle Leaderboard, leveraging attributes like brand, model, year, mileage, and more.

    Skills: Python, LightGBM, Optuna, Feature Engineering, Data Preprocessing, Hyperparameter Tuning, Model Evaluation (RMSE), Data Visualization (Seaborn, Matplotlib)

  • Wine Sommelier Analysis (Part I & II)

    Description: Built a machine learning model using NLP to predict wine grape varieties based on sommelier descriptions. Applied Random Forest and XGBoost classifiers with text preprocessing and hyperparameter tuning.

    Skills: Python, spaCy, Scikit-learn, Seleniumg, NLP, Random Forest, XGBoost, GridSearchCV, Data Cleaning, Feature Engineering

  • Stock Price Prediction using Python

    Description: Developed a stock price prediction model using Selenium to scrape the most active stock from Yahoo Finance and historical data from IEX Cloud API. Applied Linear Regression for future stock price prediction and automated email notifications for results.

    Skills: Python, Selenium, Linear Regression, Web Scraping, IEX Cloud API, Machine Learning, Data Preprocessing, Data Extraction, Stock Price Forecasting, Automation

  • Sentiment Analysis on Amazon Reviews

    Description: A project focused on analyzing Amazon product reviews using text preprocessing, word cloud visualizations, and logistic regression to classify sentiments as positive or negative.

    Skills: Python, Sentiment Analysis, WordCloud, Logistic Regression, Seaborn, Scikit-learn, Data Cleaning, Data Visualization, Text Classification, Natural Language Processing (NLP), Machine Learning


Tableau Projects

  • British Airways Reviews Dashboard

    Description: This dashboard provides insights into British Airways reviews, allowing users to explore key metrics such as seat type, traveler type, and aircraft type to analyze customer satisfaction.

    Skills: Tableau, Data Visualization

  • Data Science Job Salaries Dashboard

    Description: A dashboard analyzing data science job salaries across various countries, job titles, experience levels, and employment types, offering valuable insights into salary trends.

    Skills: Tableau, Data Analysis


Excel Projects

  • Coffee Sales Dashboard in Excel

    Description: an interactive excel dashboard that provides insights into coffee sales across different regions, customer types, and coffee varieties. It allows users to explore sales trends over time, as well as identify top-selling coffee types and loyal customers.

    Skills: Excel, Data Visualization, XLOOKUP, INDEX MATCH, Pivot Tables, Pivot Charts


Cloud Projects

  • Flight Booking Website - Cloud Project

    Description: Demonstration of a complete flight booking website built using Python Flask and AWS. The project covers user authentication, booking management, and cloud integration.

    Skills: Python Flask, AWS, Full-Stack Development

  • Hosting a Simple HTML Page on Azure

    Description: A video walkthrough of how to host a simple HTML page on Microsoft Azure, covering the basics of deploying static web content to the cloud.

    Skills: Azure, Cloud Hosting


Coursework

  • Machine Learning
  • Data Mining
  • Cloud Computing and Big Data
  • Database
  • Statistical Computing and Graphics
  • Data Structure
  • Linear Regression
  • Statistical Theory and Method
  • Business Data Analytics