I like an excuse to play with fancy things, so when i first learned about the Matrix Profile for time series analysis, particularly around anomaly detection, i was intrigued. When i learned there was a nice python package (STUMPY) i could just pip install i was outright excited, as one thing i like more than … Continue reading Anomaly Detection using the Matrix Profile
Author: andrewm4894
“Machine Learning Ireland” slack community
as it looks - made in about 20 seconds in google drawing 🙂 I've been thinking for a while that there really should be some go-to online community for people interested in ML and ireland in some way. I was fairly surprised to find that none really existed when i went to find one to … Continue reading “Machine Learning Ireland” slack community
Time series anomaly detection in Go using GoLearn
Output of the Go script. I've posted recently about learning just enough Go to be dangerous over the christmas break, well here is a update on my adventures so far. The below script (which is probably horrible in places if you know Go properly - tips welcome) uses goroutines to pull data from some REST … Continue reading Time series anomaly detection in Go using GoLearn
Anomaly Detection Resources
A list of useful Anomaly Detection resources, as I find interesting material I will add to this list. - "Awesome" time series anomaly detection list of GitHub. - YouTube playlist of interesting anomaly detection videos I maintain. - Creator of PyOD "yzhao062/anomaly-detection-resources" list of useful stuff. - An interesting r/machinelearning thread. - Public Mendeley group … Continue reading Anomaly Detection Resources
First stab at some Go (so hot right now)
It may be a combination of starting to go stir crazy over the Christmas break and some self loathing at the amount of FIFA i've been playing that's driven me to finally start learning some Go for a few data science and machine learning related projects i'm working on where it offers unique advantages. (In … Continue reading First stab at some Go (so hot right now)
Anomaly Detection Tutorial
always use a meme to kick off a tutorial Here is an anomaly detection tutorial that i created for my boss and the open source community where i work. It's part of some work i have been doing around adding some anomaly detection functionality into our open source monitoring project. Like most ML projects the … Continue reading Anomaly Detection Tutorial
Good Places For Datasets
I get asked this a lot by students so decided to make a little list in here that I can add to and point people towards. https://github.com/awesomedata/awesome-public-datasetshttps://www.kaggle.com/datasetshttps://datasetsearch.research.google.com/https://cloud.google.com/bigquery/public-datahttps://cloud.google.com/public-datasetshttps://registry.opendata.aws/https://data.world/data
Different types of time series anomalies
I find myself having to refer to this enough times that I decided to make a little infographic I can just more easily link to 🙂
Numpy Feature Engineering – 2x Speed Up Over Pandas!
The Setup This is a little one I was surprised to see. Recently I had a need to do some pretty basic feature engineering to a pandas dataframe prior to training some models. Basically I needed to take differences of each column, apply some smoothing, and then add a number of lagged columns for each … Continue reading Numpy Feature Engineering – 2x Speed Up Over Pandas!
Market basket analysis in Python
An actual market basket I found in my Google photos. tl; dr; if you find yourself doing some association rule mining using mlxtend but finding it a bit slow then checkout PyFIM - here is a colab I made to get you started. I have recently been looking to do some market basket analysis ("Association … Continue reading Market basket analysis in Python