Open in app

Sign In

Write

Sign In

Khuyen Tran
Khuyen Tran

36K Followers

Home

About

Pinned

About Me

Who I am and What Motivates me to Write — Hello! I’m Khuyen Tran. I have been writing on Medium since December 2019, but I haven’t properly introduced myself so I wrote this article to do so. I major in statistics, but I love playing with data science and Python tools and share them with others in my free time…

About Me

4 min read

About Me
About Me
About Me

4 min read


Published in

Towards Data Science

·May 26

Stop Hard Coding in a Data Science Project — Use Config Files Instead

How to efficiently interact with config files in Python — Originally published at https://mathdatasimplified.com on May 26, 2023. Problem In your data science project, certain values tend to change frequently, such as file names, selected features, train-test split ratio, and hyperparameters for your model.

Python

6 min read

Stop Hard Coding in a Data Science Project — Use Config Files Instead
Stop Hard Coding in a Data Science Project — Use Config Files Instead
Python

6 min read


Published in

Towards Data Science

·Apr 30

What is dbt (data build tool) and When should you use it?

Discover the Hidden Benefits and Drawbacks of dbt — Motivation If your organization is looking to create a data-driven product, you should consider having efficient data pipelines to: Stay competitive: Quick access to data and its analysis through efficient data pipelines accelerates decision-making and keeps you ahead of your competition. Reduce costs: The time and effort required to collect…

Data Engineering

8 min read

What is dbt (data build tool) and When should you use it?
What is dbt (data build tool) and When should you use it?
Data Engineering

8 min read


Published in

Towards Data Science

·Apr 16

Automate Machine Learning Deployment with GitHub Actions

Faster Time to Market and Increase Efficiency — Motivation Consider this scenario: A more accurate machine learning model is developed every month and added to the main branch. To deploy the model, you must download it to your machine, package it, and deploy it.

Python

8 min read

Automate Machine Learning Deployment with GitHub Actions
Automate Machine Learning Deployment with GitHub Actions
Python

8 min read


Published in

Towards Data Science

·Apr 13

Simplify Data Science Workflows on BigQuery with Fugue and Python

Speed Up Iteration and Cut Computation Cost — Motivation Many data teams begin by establishing an analytics practice on data warehouses such as BigQuery. However, solely relying on BigQuery for data science workloads may not be the best approach due to various reasons: Advanced needs beyond SQL: Use cases such as data validation, visualization, and machine learning forecasting may…

Python

6 min read

Simplify Data Science Workflows on BigQuery with Fugue and Python
Simplify Data Science Workflows on BigQuery with Fugue and Python
Python

6 min read


Published in

Towards Data Science

·Apr 6

Build Reliable Machine Learning Pipelines with Continuous Integration

Automate Machine Learning Workflow with Continuous Integration — Scenario As a data scientist, you are responsible for improving the model currently in production. After spending months fine-tuning the model, you discover one with greater accuracy than the original. Excited by your breakthrough, you create a pull request to merge your model into the main branch. …

Python

8 min read

Build Reliable Machine Learning Pipelines with Continuous Integration
Build Reliable Machine Learning Pipelines with Continuous Integration
Python

8 min read


Published in

Towards Data Science

·Mar 11

Write Readable Tests for Your Machine Learning Models with Behave

Use natural language to test the behavior of your ML models — Motivation Imagine you create an ML model to predict customer sentiment based on reviews. Upon deploying it, you realize that the model incorrectly labels certain positive reviews as negative when they’re rephrased using negative words. This is just one example of how an extremely accurate ML model can fail without…

Python

9 min read

Write Readable Tests for Your Machine Learning Models with Behave
Write Readable Tests for Your Machine Learning Models with Behave
Python

9 min read


Published in

Towards Data Science

·Jan 15

How to Structure an ML Project for Reproducibility and Maintainability

Start Your Next ML Project With This Template — Motivation Getting started is often the most challenging part when building ML projects. How should you structure your repository? Which standards should you follow? Will your teammates be able to reproduce the results of your experimentations? Instead of trying to find an ideal repository structure, wouldn’t it be…

Data Science

7 min read

How to Structure an ML Project for Reproducibility and Maintainability
How to Structure an ML Project for Reproducibility and Maintainability
Data Science

7 min read


Published in

Towards Data Science

·Jan 1

Human-Learn: Rule-Based Learning as an Alternative to Machine Learning

Incorporate Domain Knowledge into Your Model with Rule-Based Learning — Motivation You are given a labeled dataset and assigned to predict a new one. What would you do? The first approach that you probably try is to train a machine learning model to find rules for labeling new data. This is convenient, but it is challenging to know why the…

Machine Learning

7 min read

Human-Learn: Rule-Based Learning as an Alternative to Machine Learning
Human-Learn: Rule-Based Learning as an Alternative to Machine Learning
Machine Learning

7 min read


Published in

Towards Data Science

·Dec 21, 2022

Build a Full-Stack ML Application With Pydantic And Prefect

Create a UI for ML Feature Engineering in One Line of Code — Motivation As a data scientist, you might frequently adjust your feature engineering process and tune your machine learning models to get a good result. Instead of digging into your code to change function parameters: …, wouldn’t it be nice if you could change the parameter values from…

Python

8 min read

Build a Full-Stack ML Application With Pydantic And Prefect
Build a Full-Stack ML Application With Pydantic And Prefect
Python

8 min read

Khuyen Tran

Khuyen Tran

36K Followers

MLOps Engineer. Website: https://mathdatasimplified.com

Following
  • Julie Perilla Garcia

    Julie Perilla Garcia

  • Martin Heinz

    Martin Heinz

  • Sofien Kaabar, CFA

    Sofien Kaabar, CFA

  • Bex T.

    Bex T.

  • Dario Radečić

    Dario Radečić

See all (112)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams