Dev Blog
Machine learning, anomaly detection, data engineering, and side projects.
Personal notes on machine learning, anomaly detection, data engineering, and side projects.
Posts
51 total
bolt.new vs o1 - some thoughts
llm / machine learning
this is funny - i tried to make a little app to share daily factoids from chatgpt on a site - just as excuse to learn javascript really.
Anomstack - Data Engineering Podcast
anomaly detection / data eng / machine learning
I was recently on the Data Engineering Podcast to chat about anomaly detection and a open source side project im working on - anomstack.
Weights & Biases - log Keras model summary & architecture
machine learning / observability
Maybe i missed something but i could not find any easy and simple out of the box ways to just save Keras [model.summary()](https://github.com/keras-team/keras/blob/v3.0.5/keras/models/model.py#L217...
MLOps Community Podcast!!!
machine learning
What a time to be alive when nerds like us can geek out every week listening to leaders in the field chatting about what their favourite coffee is and all the cool stuff they are working on!
10 Practical ML Use Cases in Observability
anomaly detection / machine learning / observability / time series
Malloy seems petty cool...
data eng
I discovered Malloy recently in this great talk, it seems like a really interesting idea (a higher level abstraction or DSL on top of sql) with some great people behind it (looker founder who seems...
Painless Anomaly Detection with Apache Airflow
airflow / anomaly detection / machine learning
Data observability is so hot right now…but do you know what’s also hot? Using some tried and tested ingredients like Apache Airflow and PyOD to perform painless anomaly detection on your key busine...
Stripe Webhook + GCP Functions Framework (Python)
cloud / eng
This took a couple of days of messing around so decided to make a post out of it.
Colab to just run some curl
cloud
Here is a little google colab notebook to just paste in some curl command and get the response back into a json dictionary.
Some sort-of livecoding
observability
Explaining kmeans clustering for unsupervised anomaly detection
anomaly detection / clustering / machine learning
Here is a video I did in work explaining how our anomaly detection works.
Hugging Face Text Classification Quickstart
machine learning / nlp
I have been working a bit lately with some text classification stuff using Hugging Face - its great n all but their docs can actually be a bit overwhelming.
Airflow "Trigger Dags" Python Script
airflow
You have some dag that runs multiple times a day but you need to do a manual backfill of last 30 days.
Some ML hot takes
anomaly detection / machine learning / observability
Here is a blog post I made in work with some thoughts on the approach to ML that I want us to try take: https://www.netdata.cloud/blog/our-approach-to-machine-learning
Time series anomaly detection using PCA
anomaly detection / machine learning / time series
streamlit multi-page app minimal example
apps / eng
Anomaly Detection using the Matrix Profile
anomaly detection / machine learning / time series
"Machine Learning Ireland" slack community
machine learning
Time series anomaly detection in Go using GoLearn
anomaly detection / golang / machine learning / time series
Anomaly Detection Resources
anomaly detection / machine learning
A list of useful Anomaly Detection resources, as I find interesting material I will add to this list.
First stab at some Go (so hot right now)
golang
It may be a combination of starting to go stir crazy over the Christmas break and some self loathing at the amount of FIFA i’ve been playing that’s driven me to finally start learning some Go for a...
Anomaly Detection Tutorial
anomaly detection / machine learning / time series
Good Places For Datasets
I get asked this a lot by students so decided to make a little list in here that I can add to and point people towards.
Different types of time series anomalies
anomaly detection / machine learning / time series
I find myself having to refer to this enough times that I decided to make a little infographic I can just more easily link to :)
Numpy Feature Engineering - 2x Speed Up Over Pandas!
machine learning / numpy / time series
Market basket analysis in Python
machine learning
<p>An actual market basket I found in my Google photos.</p>
I helped build a thing!
anomaly detection / cloud / eng / machine learning
Time series clustering with tslearn
clustering / machine learning / time series
Premature Optimization
anomaly detection / failure
Terraform is Magic + r/MachineLearning Links
machine learning
A little brainteaser (or i'm an idiot)
machine learning
Papers i'm reading #2
Continuation from this post.
Papers i'm reading #1
anomaly detection / machine learning
I’ve recently set myself the goal of reading one academic paper a week relating to the ML/AI things i’m working on i’m my current role.
My First PyPI Package
I’ve been threatening to myself to do this for a long time and recently got around to it, so as usual i’m going to try milk it for a blog post (Note: i’m not talking about getting into a box like t...
KubeFlow Custom Jupyter Image (+ github for notebook source control)
kubeflow / machine learning
I’ve been playing around a bit with KubeFlow a bit lately and found that a lot of the tutorials and examples of Jupyter notebooks on KubeFlow do a lot of the pip install and other sort of setup and...
Multi-Variate, Multi-Step, LSTM for Anomaly Detection
machine learning
This post will walk through a synthetic example illustrating one way to use a multi-variate, multi-step LSTM for anomaly detection.
Clustering Cell Tower Usage Data
Below is a little mini project i did a while back looking at cell tower usage data.
( 0 - 0 ) / 0 != 0
Arrrgghh - I just wasted the best part of an afternoon chasing this one down. If i can knock out a quick post on it then at least i’ll feel i’ve gotten something out of it.
Java Hello World cli using args4j
java
A simple example of how to use args4j to add command line args to a simple “Hello World” type java application.
Java Weka API: Adding List To Instances Object
java
This is just a quick one to save anyone else new to the Weka api in Java spending as much time as i did figuring this one out.
Parallel Jupyter Notebooks
I have become master of the notebooks, they bend at my will and exist to serve my data science needs!
My Site...
Was doing some blogs on medium but don’t trust them to not all end up behind some sort of paywall so decided to shell out a couple quid a month for my own domain.
Previous blog posts
List of blog posts i’ve done before this site.
Java for Machine Learning
java
I’ve recently been learning java for machine learning related work (a long story to do with mainframes, and you know, why not).