Output of the Go script. I've posted recently about learning just enough Go to be dangerous over the christmas break, well here is a update on my adventures so far. The below script (which is probably horrible in places if you know Go properly - tips welcome) uses goroutines to pull data from some REST … Continue reading Time series anomaly detection in Go using GoLearn
A list of useful Anomaly Detection resources, as I find interesting material I will add to this list. - "Awesome" time series anomaly detection list of GitHub. - YouTube playlist of interesting anomaly detection videos I maintain. - An interesting r/machinelearning thread. - Public Mendeley group for interesting Anomaly Detection papers. - KDD 2020 Tutorial … Continue reading Anomaly detection resources
It may be a combination of starting to go stir crazy over the Christmas break and some self loathing at the amount of FIFA i've been playing that's driven me to finally start learning some Go for a few data science and machine learning related projects i'm working on where it offers unique advantages. (In … Continue reading First stab at some Go (so hot right now)
always use a meme to kick off a tutorial Here is an anomaly detection tutorial that i created for my boss and the open source community where i work. It's part of some work i have been doing around adding some anomaly detection functionality into our open source monitoring project. Like most ML projects the … Continue reading Anomaly Detection Tutorial
I get asked this a lot by students so decided to make a little list in here that I can add to and point people towards. https://github.com/awesomedata/awesome-public-datasetshttps://www.kaggle.com/datasetshttps://datasetsearch.research.google.com/https://cloud.google.com/bigquery/public-datahttps://cloud.google.com/public-datasetshttps://registry.opendata.aws/https://data.world/data
I find myself having to refer to this enough times that I decided to make a little infographic I can just more easily link to 🙂
The Setup This is a little one I was surprised to see. Recently I had a need to do some pretty basic feature engineering to a pandas dataframe prior to training some models. Basically I needed to take differences of each column, apply some smoothing, and then add a number of lagged columns for each … Continue reading Numpy Feature Engineering – 2x Speed Up Over Pandas!
An actual market basket I found in my Google photos. tl; dr; if you find yourself doing some association rule mining using mlxtend but finding it a bit slow then checkout PyFIM - here is a colab I made to get you started. I have recently been looking to do some market basket analysis ("Association … Continue reading Market basket analysis in Python
don't mind if i do Here is a thing i helped build in work that i'm fairly happy with: https://www.linkedin.com/posts/andrewm4894_netdata-introducing-our-first-netdata-cloud-activity-6712008465574887424-SlIr Now, onto the next thing!
I've recently been playing around with some time series clustering tasks and came across the tslearn library. I was interested in seeing how easy it would be to get up and running some of the clustering functionality that is already built into tslearn, turns out it was quite easy and straight forward, perfect blog post … Continue reading Time series clustering with tslearn