Querying 10 years of GitHub data with GHTorrent and Libraries.io

There are two fun angles coming from this article.

  1. The team over at CHAOSSEARCH has built ElasticSearch-like functionality on top of a AWS S3 buckets. It looks compelling for anyone who’s managed a large ES cluster and is looking at other ways to get search functionally out of a lot of data.
  2. Explore GitHub data shows a ton of interesting insights around popular and unpopular licenses, programming languages, and the libraries available to explore them.
