So Hyun Kim Portfolio π©π»βπ»
Born in South Korea, raised in Hong Kong and educated in the US, I bring a dynamic and diverse persective. With over two years in technology consulting, I specialize in program management, data analytics, and business intelligence to deliver robust, data-driven insights to drive decision-making.
Work Experience.

Ernst & Young
Data Analytics Team, Senior Technology Consultant2019. 01 - 2021. 05
Global regulatory change management initiative for large global bankβs data risk management SQL
Python
PowerBI
Oracle DB
Excel
VBA
Program Management
Cross-functional work
- Led program management for a regulatory change initiative by working cross-functionally with 7 teams and tracking and maintaining the key performance metric of 17 countries to drive the project to completion.
- Managed SQL server data mart on the Oracle Database for APAC region that held over 500,000 data upload templates through the development of SQL scripts for data cleansing and validation, reducing data errors and inconsistencies by ~25%.
- Eliminated manual submissions of tasks completed by creating an automated solution using Excel VBA.
- Created a PowerBI progress tracking dashboard with task-completion data that provided real-time insights into team progress of 17 countries, which was used by over 20 senior leadership in the project.
- Onboarded new consultants onto the project by providing them with project background, demos, and providing hands-on training on technologies used in the project.
EY Data Analytics Competition: Identifying opportunities during COVID-19 for Insurance Companies Python
bash
Data: Used 2 years of Twitter, news outlet and insurance publication data to conduct the below analysis and work:
- Sentiment analysis: Used sentiment analysis in Python to understand the sentiment of customers towards the largest insurance company in Asia.
- Customer segmentation: Grouped customers into groups based on age to understand the different behavior and sentiment changes pre- and post-COVID towards the insurance company.
- Used web-scraping via Python to gather data from Twitter, news outlets and insurance company website to understand how customers were feeling, how the government and society was responding to the situation, and what the insurance company was doing in response.
- Implemented decision tree classifier to categorize the sentiment of text, achieiving an accuracy score of 86%.
- Won 1st place out of 15 teams, which led to the team presenting to the Fin-tech association in Hong Kong, creating a conversational discussion around how businesses can respond during a time of hardship using data analytics.
Data governnance framework strategy proposal for local pension trustee agile
data governance
Program Management
Cross-functional Work
- Proposed an end-to-end data governance strategy pilot program for a pension trustee leveraging data analytics to identify new opportunities for customers to help them achieve the goal of becoming a more data-driven organization.
- Led interactive workshops with 25+ individuals from 8 cross-functional teams to understand their current data landscape, data pain points, and data infrastructure to create a two-phase target state program that optimized data usage for all teams.
Women in Technology (WIT) PowerBI
Python
SQL
- Acted as a data lead within the WIT group by being involved in guiding and managing data-related initiatives.
- Initiated and coordinated the launch of the first WIT group in Greater China, growing the team from 4 to 20 people within the span of 3 months by recruiting executive sponsorship and developing networking sessions dedicated to empowering women.
- Organized and led internal training in topics around SQL programming basics, Python programming basics, and PowerBI basics for over 50+ employees.
Team Project.

JumboCode
2023. 09 - Present
This is a Tufts University student-run organization that aims to provide non-profits and charities in Boston with free software solutions.
TailwindReactNext.jsTypescriptMongoDB
- Partnered up with Bi-Women Quarterly, a publication based in Boston, to create a submission portal and a database, which resulted in the elimination of a manual submission made by users (via email) and reducing data processing by ~80%.
- Developed front-end interface and submission components for the React Application to create a visually appealing application.
- Established a connection between the front-end to back-end MongoDB to streamline the data submissions made by users.

Microsoft, Analytics Capstone Project
2022. 03 - 2022. 06
Conducted behavioral analysis on collaboration, employee account and revenue data to increase employee satisfaction and enhance collaboration culture at the company
PythonSQLPowerBi
Using three years of workplace behavior data, account data, and revenue data, I did the below analysis:
- User segmentation: Segmented the sellers based on behavioral attributes to find four unique seller types using K-means clustering (k = 4)
- Data exploration: Used SQL and Python to conduct initial exploration and validation of dataset, and create new views based on the gaps identified.
- Regression analysis: Implemented multi-linear and linear regression model on the three years of data to identify behavioral attributes that contributed to the increase in revenue.
- PowerBI dashboard: Created a PowerBI dashboard with two main views - top performer vs low performer and industry-focused vs generalist sellers. The dashboard gave more insight into one of the three gaps identified.
- Workshops: Led workshops with sales teams to put more context to the data and understand pain points sellers were experiencing with regards to work place behavior (e.g. meetings, calls, messages, and emails)
Personal Project.

Data Projects
2023. 09 - 2023. 12
This section lists out projects that showcases my skill in data engineering and analytics. From designing robust data pipelines to extracting actional insights, these projects reflect my ability to leverage big data technologies.
PythonGoogle Cloud PlatformData StreamingAWSApache KafkaETLData ModellingAPIBashApache SparkBigQueryHadoop
Yelp Review Data Pipeline 2023.11 - 2023.12
Visit My Project
- Engineered an ETL pipeline using Mage for secure data extraction from the Yelp Fusion API and efficient transfer between local computer the Google Compute Engine
- Optimized data storage in Google Cloud Storage and BigQuery through indexing and queries, enhancing data efficiency by ~40%
- Developed a Looker Dashboard to analyze New York City's restaurant industry, delivering critical business insights and conducting user sentiment analysis
Real-time Stock Price Analysis 2023.11
Visit My Project
- Implemented a real-time data pipeline using Apache Kafka for streaming simulated stock market data, reducing data transfer time to AWS EC2 consumer by ~80%
- Configured AWS S3 for data storage, AWS Glue for transformation and ETL processing and AWS Athena for complex queries, collectively enhancing complex data analysis capabilities and accelerating time-to-insight
Web Server Log Analysis 2023.11
Visit My Project
- Created a Python program that randomly generates web server logs for analysis
- Analyzed Apache web server logs using Spark RDDs to determine which IP addresses generate the most client errors and to provide a breakdown of request types and distribution of five responses, resulting in a deep understanding of relevant errors
- Acquired a cluster on Google Cloud Platform (DataProc) and moved the data into HDFS to run the PySpark job, reducing workload and increasing efficiency on local computer

Java Applications
2022. 09 - 2022. 12
This section lists out projects in Java that were completed in the Software Engineering course at Tufts University.
Java
Java on Rails
Visit My Project
- Implemented a web server system called
Jrails
, leveraging ideas from Ruby on Rails, that uses a model-view-controller architecture
- Utilized gradle to build, run and test jrails and application, increasing development efficiency and ensuring a streamlined build process
Unit Testing Framework
Visit My Project
- Developed a number of testing framework components, leveraging ideas from Junit, increasing code reliability
- Implemented QuickCheck, an automated program testing technique, increasing efficiency in testing and avoiding manual testing

Web Application Development
2023. 09 - 2023. 12
This section lists out web development projects that showcases my skill in creating front-end interfaces from scratch that are engaging and intuitive.
TailwindNext.jsTypescript
JumboCode 2023.09 - Present
Visit Website
- Developed front-end interface of React Application dedicated to a submission portal, reducing data processing time by ~60%
- Established connection between front-end submission interface and the backend database using MongoDB configuration, reducing manual workload by ~80%

iOS Application
2023. 09 - 2023. 12
This section lists out iOS applications that showcases my ability to code in Swift and utilize Composable Architecture to create visually appealing and cohesive iOS application.
SwiftComposable ArchitectureXCode
Conway's Game of Life
Visit my Project
- Built an implementation of the Conway's Game of Life simulator with a visualization for iOS, written using Swift
- Utilized SwiftUI and composable architecture framework by PointFree
Education.
University of Hong Kong
2014. 09 - 2018. 12Location: Hong KongDegree: School of Business and Economics, Bachelor of Business Administration in Accounting and FinanceGPA: (3.67/4.0)University of Washington
2021. 06 - 2022. 06Location: Seattle, WADegree: Michael G. Foster School of Business, Master of Business Analytics (MSBA)GPA: (3.75/4.0)Tufts University
2022. 09 - 2024. 05Location: Boston, MADegree: School of Engineering, Master of Computer ScienceGPA: (3.91/4.0)Certificates.
AZ900 - Azure Cloud Certificate
2019. 07. 28Microsoft