Kai-Yin Huang

Your Photo

Kai-Yin Kate Huang

Master of Information Technology and Analytics in Rutgers University

katehuang.work@gmail.com

ABOUT Kate

I'm a graduate student specializing in data-driven decision-making, with a strong foundation in machine learning, data analysis, and cloud computing. My experience spans AI, data engineering, and cloud technologies, which allows me to tackle complex challenges and develop innovative solutions that drive organizational success. With a passion for applying data insights to real-world problems, I excel at bridging the gap between technical expertise and business strategy. My goal is to leverage these skills to support forward-thinking organizations in making informed, data-backed decisions that promote growth and efficiency.

SKILLS

Programming Languages

Python
SQL
R
Flask
JavaScript

Machine Learning & Data Analysis

SVM
RandomForest
Linear Regression
BERT
NLP
LLM
RAG
Neural Networks (Keras, TensorFlow)

Data Processing & Text Mining

Jieba
NLTK
Web Scraping
APIs

Visualization Tools

Tableau
PowerBI
Matplotlib
Plotly
Microsoft Excel

WORK EXPERIENCES

Rutgers, The State University of New Jersey

Jul 2024 - Present

Research Assistant

  • Conducted large-scale feature selection using Lasso regression and Random Forest on Fitbit data, enhancing predictive model accuracy through 10,000 iterations of analysis
  • Designed and implemented a data pipeline using Python (Pandas, NumPy, Plotly) to analyze and visualize COVID-19 and general mortality trends across global cities
  • Enhanced predictive model accuracy by 25% by pioneering the development of interaction-based linear models, leveraging advanced tree-based methods in Python and R
  • Applied deep learning and SVM algorithms to tackle complex data-driven problems, transforming research outcomes with actionable insights

Vosyn Inc.

Sep 2024 - Nov 2024

Backend Developer Intern

  • Spearheaded the development of robust backend APIs with Django, reducing data retrieval time by 40% for thousands of users accessing real-time data across the platform, while implementing Git version control best practices to streamline team collaboration and code deployment
  • Elevated data processing speed by 30% by building and refining RESTful APIs, ensuring smooth and instant interaction between frontend and backend even during peak traffic

Winbond Electronics Corporation

Jul 2024 - Aug 2024

AI / Data Engineering Intern

  • Designed and delivered dynamic weekly reports in SSRS and Power BI by transforming SAP data into EDW, creating tailored dashboards and visualizations to address client-specific requirements, and ensuring seamless, automated delivery
  • Architected an end-to-end course recommendation system by implementing web crawlers to gather data from three platforms, and leveraging NLP techniques (TF-IDF, word2vec) to build a machine learning-powered engine that improved recommendation relevance by 85% for department-specific training needs
  • Created a Flask-based web server with integrated OpenAI Assistants API, reducing manual input for data engineers by 60% by automating SQL assistance and SAP column naming
  • Demonstrated exceptional presentation and documentation skills by delivering comprehensive technical reports and engaging presentations, ranking 2nd among 15 interns in the final evaluation for clear communication of complex technical concepts and project achievements

TungHai University

Sep 2022 - Jun 2023

Teaching Assistant for Statistics

  • Mentored over 50 students, breaking down complex statistical problems into digestible concepts, resulting in a 38% improvement in students’ exam scores and overall course comprehension
  • Analyzed student performance data using statistical techniques, delivering insights that helped professors adjust their teaching strategies, improving the course pass rate by 95%

PROJECTS

SQL Database Development Project: Fitness Coaching Platform

Engineered a containerized PostgreSQL database system for a fitness coaching platform, implementing relational data structures and complex SQL queries to streamline booking management and performance analytics.

View Project

Socioeconomic and Demographic Analysis of COVID-19 Case Rates in NY

Conducted ZIP code-level analysis of COVID-19 transmission patterns in New York State, utilizing regression modeling to identify key socioeconomic and demographic predictors of infection rates (R² = 0.840).

View Project

NBA Database

Designed an optimized SQL database for the NBA 2009 dataset, implementing ETL processes and query optimization techniques.

View Project

How “Hallyu” Impacts the World

Conducted statistical analyses and created visualizations using Tableau and Python to assess the global impact of South Korea's entertainment industry.

View Project

Accenture North America Data Analytics

Analyzed datasets and created Tableau visualizations for a social media client, improving data insights.

THU Library System

Created a comprehensive system using MS Access, enhancing library efficiency through book management and user tracking.

View Project

Titanic Analytics

Constructed predictive models for survival likelihood, identifying XGBoost as the best performer with 82.12% accuracy.

View Project

LINE Bot - Taiwan Weather Tool

A LINE Bot built using Node.js to provide weather information for Taiwan's cities.

View Project

EDUCATION

Rutgers, The State University of New Jersey

Sep 2023 - Dec 2024

Master of Information Technology and Analytics

TungHai University

Sep 2020 - Jun 2024

Bachelor of Business Administration, Major in Information Management