A Highly-motivated Data Scientist from Indonesia who is currently into NLP.
Hi, I am Louis' personal AI assistant. Please ask me anything about him with a complete sentence (e.g: "Where does he live now" not "location"), and I'll try my best to answer your queries :)
Intro
Three words for Louis: strong-willed, fast-learner, effective.
Louis Owen is a strong-willed, fast-learner, and effective Data Scientist who is always hungry for new knowledge. He is
currently helping to deliver NLP solutions at Yellow.ai,
a world's leading CX automation platform. He pursued a Mathematics major at one of the top universities in Indonesia,
Institut Teknologi Bandung, under a full final-year scholarship.
Before joining Yellow.ai, Louis was a Data Science Consultant at The World Bank
and AI Research Engineer at Bukalapak.
Throughout his career journey, he worked at various fields of industry: NGO, e-Commerce, Conversational AI, OTA,
Smart City, and FinTech. Please see the Resume section for more information about his experiences.
What they said about Louis
Working in the same team with Louis is awesome and memorable, as stated by his colleagues.
A Life Outside of Work
Louis loves to spent his spare time doing his hobbies:
watching movies, conducting side-projects, and writing articles.
He also loves to give back to the community by sharing his experiences and knowledge.
Finally, Louis loves to meet new friends!
So, please feel free to reach him out if you have any topics to be discussed.
Googling, Python, Git, Cloud Platforms, Tensorflow, Pytorch, Tableau, SQL, R, Docker, Design Thinking, Google Data Studio, Matlab
Working Experience
NLP Engineer - Yellow.ai
Jan 2022 - Present
Improving the NLP system for Indonesian language
AI Research Engineer - Bukalapak
Nov 2020 - Jan 2022
Developed the internal financial metrics anomaly detection system which succeed to prevent 90% of future
financial loss and reduce 20x false alerts compared to the previous implementation
Developed the hierarchical multi-class text classification model (product title & search query) that succeed to
outperform the previous solution by 25% while maintaining low latency
Developed a multi-modal latent embedding generator (NLP & CV) for various downstream tasks, including but
not limited to NSFW detection, search engine enhancement, product scoring, etc
Developed the Robo advisory system that can help users’ investment decision
Initiated and created the centralized code-blocks documentation within the Bukalapak AI team
Actively contributed in several internal talks and external article publications
Data Science Consultant - The World Bank
Apr 2020 - Mar 2021
As a data science short-term consultant for a COVID-19 collaboration project between The World Bank, MIT, Stanford, and Mafindo
Developed a topic monitoring end-to-end pipeline regarding COVID-19 handling in Indonesia based on social media and news media data, which will be presented in a public website and can be used by the policymaker
Automated the broadcast and pre-filled form for the COVID-19 survey
AI Engineer Intern - Qlue Smart City
Feb 2020 - May 2020
Developed an age and gender image classification model using SOTA pre-trained model starting from data gathering until model optimization phase
Developed a mask facial detection model using SOTA pre-trained model starting from data generation until model optimization phase
Created an analysis report about Multiple Object Tracking and Human Action Recognition SOTA methods
Data Scientist Intern - Do-It
Dec 2019 - Jan 2020
Developed a prediction model for Non-Performing-Loan with around 0.85 balanced accuracy using both internal and external data starting from data gathering until deployment phase
Developed a segmentation and cluster prediction model which could increase the manpower efficiency and reduce around 80% of SMS notification cost for recollection phase
Lecturer Assistant - Bandung Institute of Technology
Aug 2019 - Dec 2019
Calculus lecturer assistant for Professor Marcus Wono Setya Budhi Ph.D.
Introduction to Computation lecturer assistant for Fajar Yuliawan, S.T., M.Si.
Data Analyst Intern - Tokopedia
May 2019 - Aug 2019
Developed a complex rule-based segmentation algorithm of Tokopedia Buyer from the logistics side using Python, BigQuery and Tableau
Conducted an analysis of the promo usage by Tokopedia Buyer with RFM framework and decision tree algorithm using Python and Tableau
Created an infographic about Tokopedia Digital performance based on internal and external data
Marketing & Analystics Intern - Traveloka
May 2018 - Aug 2018
Conducted a research about Search Engine Optimization focusing on On-Page Optimization
Conducted a grand audit for Traveloka’s On-Page Optimization
Benchmarked Traveloka’s On-Page Optimization with best practices and competitors
Created a ranking formula model to forecast a potential improvement on some business aspects
Created the operational business process for the future project
publications
L. Owen and F. Oktariani, SENN: Stock Ensemble-based Neural Network for Stock Market Prediction using
Historical Stock Data and Sentiment Analysis, 2020, The 3rd International Conference on Data Science and Its
Applications (ICoDSA).
[Paper ]
[Video]
[Code ]
[PPT]
Education
Bandung Institute of Technology
Bachelor of Science - Mathematics | GPA: 3.92/4.00 (Summa Cumlaude)
Honours and Awards
Scope
Description
Rank
International
Global AI Innovation Challenge 2020 held by Alibaba Cloud
1st Winner
National
Data Analytics Competition FIND IT 2019
held by Gadjah Mada University
1st Winner
International
Hack Asia “Know Your Data, Know Your
Customer” Data Hackathon held by Jardines Matheson
1st Runner Up
National
National Data Science Challenge 2020 helb by Shopee
1st Runner Up
National
HackData Virtual Hackathon 2020 held by Indosat
1st Runner Up
National
BRI Data Hackathon - Cash Ratio Optimization
held by Bank Rakyat Indonesia
Top 5
International
EY NEXT WAVE Data Science Challenge
held by Ernst & Young
Country Finalist
University
Most Outstanding Student in First Year
-
University
Dean’s List Awardee from the First until Last Semester
-
National
Grab Tech Future Leader Awardee: A tech future leader program held by Grab
Indonesia with only around 2% acceptance rate
-
University
Tokopedia Scholarship Awardee: Full tuition fee, monthly allowance, graduation
allowance, and book allowance for 1 year with
only around 4% acceptance rate across Indonesia (the only awardee in his university)
-
Volunteer Experience
Data Science Mentor - Dibimbing.id
May 2021 - Present
Taught 300+ students about: Data Science Portfolio 101, Python Data Types & Structures, Data Cleaning, Data Manipulation,
Exploratory Data Analysis, and Hyperparameter Tuning
Guest Lecturer - Institut Teknologi Bandung
Sep 2021 - Oct 2021
Taught 50+ Mathematics undergraduates on the Lokakarya (Workshop) class about Data Science lifecycle as well as practical Data Science knowledge through hands-on projects
Contributor - AI4Finance
Sep 2020 - Feb 2021
As a contributor for FinRL open-source repo, a repo which provides Deep Reinforcement Learning solutions for Algorithmic Trading task.
The initiation of this project is motivated by a simple question comes to Louis' mind: My portfolio seems too static. What should I do to make it more interactive and fun while also applying my NLP knowledge on it?
A dataset that contains >150k question pairs from First Quora Dataset Release: Question Pairs which marked as duplicates, protected by Terms of Service.
This dataset is translated from English to Bahasa Indonesia using Google Translate API.
The motivation of this project is to enrich the collection of Indonesian NLP corpus
Plagiarism Detection System
A local plagiarism detection system for Bahasa Indonesia.
Responsible for building the whole end-to-end pipeline, starting from the data gathering, training data generation, model training & evaluation, deployment code refactor, visualization, and documentation.
This is a collaboration project with one of Indonesian ed-tech start up, Practisee.
Customizable Real-time Dashboard to Monitor Your Google Form Responses
In this article, Louis gives a detailed step-by-step tutorial on how to visualize real-time Google Form responses in Streamlit dashboard, starting from importing responses in google sheets until the deployment of the dashboard.
The initiation of this project is motivated by a simple question comes to Louis' mind: Is it possible to do text-classification with 150 target classes using only 10 labelled samples for each class but still get a good performance?
Summary of "Neural Network Methods for Natural Language Processing" by Yoav Goldberg and "Deep Learning" by Ian J. Goodfellow
Louis created this book summary for his undergradute thesis and he thought it will be useful for many people so he share this to the LinkedIn Community
In this article, Louis introduced 3 awesome free packages which can generate an HTML report comprises various kind of EDA, in just a couple lines of Python code!
A repo which provides the code to generate instant data report using an R package called Data Explorer which is executed in Python using the help of RPy2
This is a collaboration project between Louis Owen, Vinson Ciawandy and Evan Martua
Given the flight and hotel transaction history data, they try to predict the flight and hotel cross-selling in one of the biggest ticketing company in Indonesia
The initiation of this project is motivated by a simple question comes to Louis' mind: Is it possible to do text-classification with 150 target classes using only 10 labelled samples for each class but still get a good performance?
In this article, Louis shared several simple yet powerful approaches to detect anomaly in time-series data that is not usually discussed in many articles.
In this article, Louis gives a detailed step-by-step tutorial on how to visualize real-time Google Form responses in Streamlit dashboard, starting from importing responses in google sheets until the deployment of the dashboard.
In this article, Louis shared his knowledge on how to create pre-filled Google Forms automatically and how to blast them to your target respondents’ email using Python.
In this article, Louis introduced 3 awesome free packages which can generate an HTML report comprises various kind of EDA, in just a couple lines of Python code!
(Dec 11, 2021) Speaker at Twitter Developer Meetup 2021 (Bangladesh): AI Implementation in Industry. [Delivered in English]
(Dec 4, 2021) Speaker at Gunadharma University Google Data Science Club: Deep Dive into Real World Machine Learning. [Delivered in Bahasa Indonesia]
(Nov 27, 2021) Speaker at Ahmad Dahlan University Data Science Public Lecture: Mathematics and Data Science for a Better Indonesia. [Delivered in Bahasa Indonesia]
(Nov 20, 2021) Speaker at ITB University AI Center of Excellence: How Bukalapak Utilizes Artificial Intelligence. [Delivered in Bahasa Indonesia]
(Oct 07, 2021) Speaker at ICoDSA 2021 Workshop: The Importance of Data Preparation in Industry. [Delivered in English]
(Oct 03, 2021) Speaker at Talent Growth: How to Learn Data Science Effectively. [Delivered in Bahasa Indonesia]
(Oct 02, 2021) Speaker at Studium Generale Universitas Diponegoro: Mathematics in Real World. [Delivered in Bahasa Indonesia]
(Sep 16, 2021) Speaker at Glints Expert Class: How to Create a Marketable Data Science Portfolio. [Delivered in Bahasa Indonesia]
(Sep 04, 2021) Speaker at Datapath: Exploratory Data Analysis in Python. [Delivered in Bahasa Indonesia]
(Aug 29, 2021) Speaker at COMPFEST UI 13: Getting Started with Hyper-parameter Tuning in Machine Learning. [Delivered in Bahasa Indonesia]
(Aug 27, 2021) Speaker at Bitlabs: How to Become Data Scientist with No IT Background. [Delivered in Bahasa Indonesia]
(Aug 14, 2021) Speaker at Miloo: Indonesia Daulat AI "Ekosistem Tangguh, SDM Tumbuh". [Delivered in Bahasa Indonesia]
(Jul 31, 2021) Speaker at Universitas Brawijaya: Tips & Tricks to Kickstart Your Data Science Journey. [Delivered in Bahasa Indonesia]
(Jun 19, 2021) Speaker at She Loves Data: Automating Supervised Machine Learning Pipelines in Python. [Delivered in English]
(May 16, 2021) Speaker at dibimbing.id: Data Science Portfolio 101. [Delivered in Bahasa Indonesia]
(Apr 22, 2021) Speaker at Purwadhika Digital Technology School: Ultimate Career Guides as Data Scientist. [Delivered in Bahasa Indonesia]
(Dec 18, 2020) Speaker at Universitas Pendidikan Indonesia: Tips & Tricks to Kickstart Your Data Science Journey. [Delivered in Bahasa Indonesia]
(Oct 17, 2020) Speaker at DPhi: Automating Supervised Machine Learning Pipeline in Python. [Delivered in English]
(Oct 10, 2020) Speaker at MindHack Summit (India): "Recipes to 'Boost' your Data Science Journey". [Delivered in English]
(Oct 3, 2020) Speaker at PyData Pune (India) Chapter: "Combining the Power of Historical Stock Data, the Wisdom of Crowds, and the Advance of Deep Learning in Stock Market Prediction". [Delivered in English]
(Sep 5, 2020) Speaker at Jakarta Machine Learning: "Stock Market Prediction using Historical Stock Data and Sentiment Analysis". [Delivered in English]
(Sep 4, 2020) Speaker at Quantum.ai (Bangladesh): "Ultimate Guides to Kickstart your Data Science Journey". [Delivered in English]
(Aug 22, 2020) Speaker at FOKUS HIMATIKA ITB 2020: "Tips & Tricks to Kickstart Your Data Science Journey". [Delivered in Bahasa Indonesia]
This is bold and this is strong. This is italic and this is emphasized.
This is superscript text and this is subscript text.
This is underlined and this is code: for (;;) { ... }. Finally, this is a link.
Heading Level 2
Heading Level 3
Heading Level 4
Heading Level 5
Heading Level 6
Blockquote
Fringilla nisl. Donec accumsan interdum nisi, quis tincidunt felis sagittis eget tempus euismod. Vestibulum ante ipsum primis in faucibus vestibulum. Blandit adipiscing eu felis iaculis volutpat ac adipiscing accumsan faucibus. Vestibulum ante ipsum primis in faucibus lorem ipsum dolor sit amet nullam adipiscing eu felis.
Preformatted
i = 0;
while (!deck.isInOrder()) {
print 'Iteration ' + i;
deck.shuffle();
i++;
}
print 'It took ' + i + ' iterations to sort the deck.';