Header Image

--Introduction

Jonathan Lacanlale's Portfolio

My name is Jonathan Lacanlale, and I was born and raised in Los Angeles, CA. I graduated in 2021 from Cal State University, Northridge (CSUN) with a B.S. in Computer Science and Minor in Mathematics, and I'm currently pursuing my Masters in Data Science at U.T. Austin.

As of March '24, I've been working full-time as an Analytics Engineer for the mission-driven non-profit known as Didi Hirsch, which focuses on providing mental health services to all communities. Here, I'm fortunate to take both ownership and creative ability in developing data tools that enable both clinical and internal teams to make data-driven decisions.

My long-term work passion is to pursue a career that focuses on working with data from a technological and analytical perspective. I am continually cultivating this ambition into a data science-centered career that will allow me to apply my professional skillset to missions that I believe in.

Work Experience

Throughout my career, I've been fortunate to work with multiple domains of data, varying from financial data, biomedical research, image and videos, textual data, and even data fed from raw sensors. For each project and position I've been involved in, I've had a large stake in both the technical and analytical work. This includes ETL/ELT pipeline development, serving adhoc data requests through SQL, creating data visualizations/dashboards, communicating data insights, and much more.

These experiences continue to lead me to pursue my passion in data-centric work and further support my growth as a data scientist.

Analytics Engineer

Didi Hirsch, Mar '24 - Present

Enabling streamlined decision making through improved data reporting efforts and supporting clinicians and internal staff as the primary Analytics Engineer within the Corporate Reporting team.

Key Responsibilites:

  • - Spearhead the development and maintenance of dataflows to support multiple teams and staff in a cross-functional manner.
  • - Take ownership of the development and design of data dashboards on multiple platforms used to both automate data retrieval process, as well as provide insight into ongoing performance.
  • - Meet with teams in a cross-functional and collaborative manner to improve data reporting capabilities, and innovate in ways that reduce operational bottlenecks.
  • - Conduct research and development of core KPI's and visuals that contribute to the optimization of business performance from both a financial and operational perspective.
  • - Created documentation that clarified contractual obligations, restraints, technical processes, and key points-of-logic which was understood by staff at varying levels of management (including C-Suite/Executive).

      Primary Tools:
      • - Snowflake SQL
      • - Microsoft SQL Server
      • - Microsoft PowerBI, DAX
      • - Microsoft PowerApps
      • - Microsoft Excel

Data QA Analyst

Enervee, Aug '21 - Jan '24

Sat within the core data team among software and energy engineers to develop data monitoring systems, improve overall data QA across the product vertical, and take lead on data stewardship activities to support an organization's mission in improving energy efficiency at the consumer level.

Key Responsibilites:

  • - Increased data retrieval and manipulation efficiency by facilitating the integration of Django ORM to streamline database interactions
  • - Assembled large, normalized data lakes by constructing data (ETL) pipelines capable of ingesting and transforming data from over 20+ disparate data sources using Python, Django, and S3
  • - Collaborated with external vendors and product stakeholders to understand data capabilities and product scope, leading to the delivery of detailed feature proposals and product roadmaps
  • - Architected a data platform for managing over 10000+ consumer-facing products for an eCommerce catalog sourced from multiple 3rd party resources
  • - Utilized Python, Looker, and SQL to engineer data reporting systems for consistent dashboard and visualization delivery to communicate business insights and pipeline performance to stakeholders
  • - Designed data visualizations and analyses to identify root cause of data issues leading to the improvement of data quality across a large consumer-facing eCommerce catalog
  • - Engineered complex SQL queries to serve ad-hoc requests to enable product stakeholders in making data-driven decisions

      Primary Tools:
      • - Python (Jupyter, Pandas, Numpy, Django)
      • - PostgreSQL
      • - Docker
      • - Git
      • - Google Looker Studio

Research Software Engineer

CSUN Computer Vision and Image Processing Lab, Aug '19 - Jun '21

Took ownership in researching the feasibility of using computer vision models for quantifying treatment agents to streamline the workflow of biomedical resreachers.

Key Responsibilites:

  • - Constructed end-to-end data pipelines using Python and Pandas to enhance preprocessing and ingestion rates capable of producing an image dataset of 1000+ images and reducing manual workload by 70%
  • - Independently spearheaded the development of 3+ common classification machine learning models for comparison of performance and presentation at multiple scientific conferences
  • - Mentored junior researchers in utilizing data visualization tools in order to scale collaborative research efforts

      Primary Tools:
      • - Python (Jupyter, Pandas, Numpy, OpenCV)
      • - MATLAB
      • - Google Colab
      • - Git

Research Analyst

University of Houston, Jun '20 - Aug '20

Contributed to ongoing cybersecurity research in understanding the processing and visualization of network packet data between cloud systems.

Key Responsibilites:

  • - Utilized remote Amazon Web Service instances across the domestic United States to simulate remote connection chains on a secure network
  • - Analyzed network packet information later used by machine learning classification models such as Random Forests and support vector machines
  • - Conducted in-depth analysis of network data to identify connection chain lengths between secure shell connections

      Primary Tools:
      • - AWS
      • - Python (pandas, numpy, sci-kit-learn)
      • - Google Slides

Research Software Engineer

UCSD Data Analytics Lab, Jun '19 - Aug '19

Took lead in researching the stepping-stone component of a larger reserach initiative. The results of my work in contribution to the astute academic scholars lead to the early comparitive models of automated data categorization systems using machine learning and AI. This work was recognized for benchmark comparisons in early development models by several companies such as Amazon, OpenAI, and Meta.

Key Responsibilites:

  • - Scaled internal data storage systems through the systemization of data maintainability and implementation of strict data usage guidelines
  • - Conducted thorough investigations into the data quality, integrity, and relevancy across 10+ external data sources, ensuring reliable and accurate information is used for research purposes
  • - Successfully programmed 5+ text classification machine learning models through Python, scikit-learn, and Tensorflow, allowing for publication of insights at multiple scientific conferences and AI research groups
  • - Utilized Jupyter Notebooks and visualization libraries for data analysis to consistently deliver accurate and comprehensive research reports with insightful visualizations over the course of a strict 3-month deadline

      Primary Tools:
      • - Python (pandas, numpy, matplotlib)
      • - LaTeX
      • - Google Slides

Research Analyst

CSUN Mobile Computing Lab, Sep '18 - Aug '19

This was my first professional work-experience that was coding-based. The outcome was research that explored the applications of low-cost sensors to be designed as wearable controllers for users with limited mobility.

Key Responsibilites:

  • - Accomplished a first-authored research paper that describes the results of utilizing low-cost wearable sensors to explore technology capable of tracking human head movement
  • - Utilized simple machine learning models to categorize head movement based on real-time data
  • - Demonstrated the capability of microcontrollers accurately tracking and sending data to communicate with machine learning models

      Primary Tools:
      • - Arduino Nano
      • - Raspberry PI
      • - Python (pandas, scikit-learn, numpy)

Published Research Project

Towards Benchmarking Feature Type Inference for AutoML Platforms

Published research project that focuses on utilizing machine learning models to enable ML feature type inference. The work I was fortunate to contribute to the lab heavily focused on gathering and cleaning data from multiple, disparate datasets, and processing these datasets to be used in training machine learning models. Many thanks to UCSD, the STARS research program, and Dr. Arun Kumar and his Data Analytics Lab for the opporutnity to do such work.

Image
Image

Published Research Project

Mosquito Count Automation to Quantify Odor Reception

This was my first first-authored research project (as-in I was the lead in conducting experiments, writing the code, and writing the paper from draft-to- publication). The work outlined in this paper utilizes object detection methods taken from the field of computer vision. We use detection models to help quantify objects-of-interest (here, we count mosquitos) in an effort to support biomedical research.

Interactive EDA Project

USPA Beginners over the past 10 years

This was a fun personal project to explore the athlete data collected by OpenPL. As a brief primer, OpenPL collects data from powerlifting meets, allowing for tracked and up-to-date data on powerlifters and the evergrowing niche community of powerlifting. As a powerlifter myself, I was curious as to how other people start out in the sport and what those numbers have looked like over the past few years.

Image

Hobbies

Image

Powerlifting

I'm a huge fan of powerlifting, and just strength-training in general. I've competed in powerlifting for over 2 years now, and hope to continue for the many years to come!

Image

Travelling

I love travelling! I've been fortunate to travel to different parts of North and Central America, and hope to continue travelling ineternationally and visit other parts of the world.