menu
 

Unlocking Innovation: My First Industry Hackathon Adventure with Walmart and Google

date_range 26/08/2023 22:15

My first hackthon in a company, organized by Walmart and Google, was an exciting venture into the world of innovation and technology. Taking place over four days, starting from Monday and ending on Thursday, it was a whirlwind of creativity and collaboration.

Unleashing the Power of ChatGPT Language Model: a Journey with DeepLearning.AI course

date_range 05/08/2023 22:30

As a data science practitioner eager to harness the potential of artificial intelligence, I embarked on an enlightening journey with renowned AI expert, Andrew Ng and OpenAI techincal memeber Isa Fulford. Their courses, “ChatGPT Prompt Engineering for Developers” and “Building Systems with the ChatGPT API,” offered on DeepLearning.AI, opened new horizons in the world of large language model (LLM) of ChatGPT. Join me as I share my experiences and express my gratitude for the invaluable knowledge gained from these transformative courses.

Differences Between Working on Machine Learning and Software Engineering Projects

date_range 11/03/2023 21:48

I have been working in the machine learning and data science (ML/DS) engineering area in industry for a year. Also, I had worked in the software engineering (SE) area for two years before this. Some of the SE experience could be shifted to ML/DS, but most of them are totally not transferrable. Here are my personal persepctives.

Distributed Training and Test in Spark XGBoost

date_range 04/03/2023 22:30

Recently, we have done a project with xgboost model for classification. With the increasing of large amouts of data, we need to use XGBoost distributed training to replace the current pandas XGBoost training solution in Spark.

Summary of Predictive Model Performance: Offline and Online Evaluations

date_range 20/11/2022 00:30

Here we summarize a paper from “Predictive Model Performance: Offline and Online Evaluations” from Microsoft. It analyzes the offline and online metric discrepancy problem, and the simulated metric for simulating online performance.