Java In The Driving Seat

Another weekend spent mostly learned after a pivot to learning Java to create apps. I did get some product output on Sunday evening but not enough to be meaningful.

Read More

Switching Gears

B.A.W went better this past weekend, but it was more learning than building. Upon expert recommendation, I’m switching to Java (a language I hasn’t used since college) from Python to make my app. I also made some progress in my...

Read More

Nothing

B.A.W went badly this week with only a couple of new tees to show for a weekend’s worth of (not much) work.

Read More

Shirts And An Unruly Chat App

Build-A-Weekend (for now and ever called BAW) continued some time on Saturday. I dabbled with the UI of the Kivy app I’m building, but didn’t really get much done.

BAW began early Sunday morning with me spending a couple of...

Read More

Build-A-Weekend Gets Off The Ground

Build-A-Weekend is the term I created for my goal to consistently work on weekend project ideas. The first two weekends after its inception did not go well. This weekend it finally gears up. A friend and I have an app...

Read More

My First Bot

I wanted to make a bot for a long time, and I just got around to it a couple of days ago. My Markov Philosophy Bot relies on the input of five philosophical works into a Markov Chain to produce...

Read More

Data Science Self-Reflection

My time studying Data Science has led to the bookmarking of many resources. I keep a tiny minority close for reference, but most are hidden in deep virtual storage rooms, only snatching brief looks at the light of day when...

Read More

Hate Speech Project

The beauty of the internet comes with its share of ugliness. In recent months, Twitter’s dark side has been in the spotlight. Increasingly its the platform’s bad eggs that seem to define it, not its usefulness for connecting people across...

Read More

ODSC And Me

It’s been a while since my last post. For the last four or so months I’ve been working for the Open Data Science Conference (ODSC). Over that time I’ve written a number of blogs. You can find all of them...

Read More

Blasters and Bayesian Stats

I’m alive! For my first post of the new year I’ll apply some Bayesian Stats to Star Wars. This was originally down in an Jupyter notebook which I’ll just post below.

Who Is The Best Shot In Star Wars?
...
Read More

Week 11

The penultimate week of the bootcamp bought around round of Machine Learning. First we explored Machine Learning in Apache Spark, then we looked at Association Rule Mining and used R to explore Naive Bayes after previously seeing it in Python....

Read More

Week 10

The tenth week of the bootcamp brought the Hadoop ecosystem to the table. During the week we intertwined theory and practice as we started from first principles with Map-Reduce, and then worked our way up to Hadoop, Hive, and Apache...

Read More

Week 9.75

I hope the random Harry Potter reference was spotted.

This ‘week’ was only two days long due to Thanksgiving. The first day involved a case study where we split into different groups, with each group assigned an algorithm or way...

Read More

Week 9

The time of the end is coming. The end in general, but, more immediately, the end of our travels in Machine Learning.

This past week the content tilted more towards Python than R as we covered ground in the former...

Read More

Week 8

In the eight week we started off with a quick discussion of some hard Computer Science concepts in Python. These included sorting algorithms and data structures. After that we moved on to R to talk about all the theory behind...

Read More

Pydata NYC 2015

November 11 is Veteran’s Day in the United States, and that meant a midweek break from the Data Science bootcamp. There was no rest for the weary, though, as I headed off to the Bank of America Center to experience...

Read More

A Tale of Two Hackathons

I just thought I would write a quick note about two hackathons I participated in recently.

About two weeks ago I went to a hackathon hosted by BetaNYC, an organization looking to better New York City through data. It was...

Read More

Week 7

In this week we returned to R and Machine Learning to look at the curse of dimensionality and clustering. We went to the depths of Principal Component Analysis, Lasso and Ridge Regression, and K-means and Hierarchical Clustering. The real ‘Aha!”...

Read More

Week 6

We’re halfway there.

This week we stepped up to another level of Python with a look at numpy, scipy, pandas, matplotlib, and seaborn. Several exercises and a challenging lab served to provide ample practice for utilizing these libraries. At the...

Read More

Week 5

In my last blog post I spoke about my worry that the fifth week of the bootcamp would be uninteresting due to my previous exposure to Python. Sadly, that mostly turned out to be the case. The week consisted mostly...

Read More

Week 4

Week 4 began with Columbus Day. That shortened the week, but some of the cohort – including myself – did come in to do work. The week’s work began in earnest on Tuesday with a dive into Generalized Linear Models...

Read More

Week 3

The bootcamp’s third week started off with a review of Statistics. We then moved on to dealing with missing data by imputation. The discussion on missingness acted as a segway into Machine Learning through the K Nearest Neighbors (KNN) algorithm...

Read More

Week 2

This post is almost a week late and with good reason. The pace moved up several notches in the second week of the bootcamp as we started out with a deep dive into Data Visualization in R geared towards using...

Read More

Week 1

The first week of the NYC Data Science Academy Bootcamp has come and gone. This week we started off with an overview of working with the Linux command line interface, Git and GitHub, SQL, and ended with diving into the...

Read More

Beyond Coding and Webscraping

This post was originally written as an IPython notebook, a link to which I planned to include in this blog post. Then I had the great idea of just using the notebook itself. Turns out converting it a markdown file...

Read More

First Steps in Tableau

Data Visualization is an integral part of Data Science, so it’s without question something to master. Two of the most well known tools for visualization are D3 and Tableau. The first I have to do some Javascript knowledge building in...

Read More

An Update

It’s been a long while since I’ve made a post. Although the blog has been collecting cobwebs, my mind is undergoing constant cleaning and renovation. I’ve spent the last couple of months solidifying previous knowledge, learning new things, and practising...

Read More

An Adapted Tutorial

Recently a prominent blogger in the Soccer Analytics community made a tutorial about using Monte Carlo Simulations for the calculation of a metric called Expected Goals (xG) using R. With my less than positive relationship with R, I decided to...

Read More

So It Begins

I took a while to get around to creating this site. As per the description, it’s my goal to become a Data Scientist. I’m on the path, and I’m making progress. This site will serve as my base before and...

Read More