I have been working a bit lately with some text classification stuff using Hugging Face - its great n all but their docs can actually be a bit overwhelming. So here is a minimal text classification example, using huggingface and either pytorch or tensorflow (you decide). Will try to update and maintain the colab here: … Continue reading Hugging Face Text Classification Quickstart
Airflow “Trigger Dags” Python Script
You have some dag that runs multiple times a day but you need to do a manual backfill of last 30 days. It's 2022 and this is still surprisingly painful with Airflow. The "new" REST API helps and mean's all the building blocks are there but, as I found out today, there can often still … Continue reading Airflow “Trigger Dags” Python Script
Some ML hot takes
Here is a blog post I made in work with some thoughts on the approach to ML that I want us to try take: https://www.netdata.cloud/blog/our-approach-to-machine-learning
Time series anomaly detection using PCA
Here is a little recipe for using good old PCA to do some fast and efficient time series anomaly detection.
streamlit multi-page app minimal example
too obvious? maybe. probably. Recently i had a need to assess streamlit for some internal DS/ML/Data apps i wanted to build in my job. By "i had a need" i mean i heard it was the new cool thing so i wanted to play with it and feel better about myself. Anyway, as part of … Continue reading streamlit multi-page app minimal example
Some asyncio fun/pain
Taken from this great Talk Python Training course - get the lifetime bundle if you can! You have a list of api endpoints you want to pull data from and collect results into some results list or dataframe for further processing. You could just loop over that list and make a load of requests.get() calls … Continue reading Some asyncio fun/pain
Anomaly Detection using the Matrix Profile
I like an excuse to play with fancy things, so when i first learned about the Matrix Profile for time series analysis, particularly around anomaly detection, i was intrigued. When i learned there was a nice python package (STUMPY) i could just pip install i was outright excited, as one thing i like more than … Continue reading Anomaly Detection using the Matrix Profile
“Machine Learning Ireland” slack community
as it looks - made in about 20 seconds in google drawing 🙂 I've been thinking for a while that there really should be some go-to online community for people interested in ML and ireland in some way. I was fairly surprised to find that none really existed when i went to find one to … Continue reading “Machine Learning Ireland” slack community
Time series anomaly detection in Go using GoLearn
Output of the Go script. I've posted recently about learning just enough Go to be dangerous over the christmas break, well here is a update on my adventures so far. The below script (which is probably horrible in places if you know Go properly - tips welcome) uses goroutines to pull data from some REST … Continue reading Time series anomaly detection in Go using GoLearn
Anomaly Detection Resources
A list of useful Anomaly Detection resources, as I find interesting material I will add to this list. - "Awesome" time series anomaly detection list of GitHub. - YouTube playlist of interesting anomaly detection videos I maintain. - An interesting r/machinelearning thread. - Public Mendeley group for interesting Anomaly Detection papers. - KDD 2020 Tutorial … Continue reading Anomaly Detection Resources