GitHub Archive: API for GitHub's historical timeline

You folks create a lot of events on GitHub. Trying to mine and report on that data is definitely a big data problem. Ilya Grigorik wants to help the community get a handle on the GitHub firehose with GitHub Archive. With an API call you can get a range of GitHub’s public timeline data for all seventeen event types:

require 'open-uri'
require 'zlib'
require 'yajl'

gz = open('')
js =

Yajl::Parser.parse(js) do |event|
  print event

The source is on GitHub. Be sure and listen to Ilya on episode #55 talking about Goliah, EventMachine, and SPDY.

News Films

Our little film studio focuses on telling developer-centric stories that need to be seen.

Beyond Code: Season 3 / GopherCon 2015

0:00 / 0:00