Search results for Commons Claus

The rise of user-hostile software

From Den Delimarsky:

We are truly living in an era of user-hostile software, and when I say “user-hostile” I mean it as “software that doesn’t really care about the needs of the user but rather about the needs of the developer.” And this is not a problem that is bound to a specific operating system (or version thereof) or class or computers. It’s literally cross-platform, and it follows customers from home, to office, to their commute.

Let me give you some examples from my own experience…

A common offender for me are apps that require permissions to “access machines on your local network.”

changelog.com

Holmes - CSS based error detection, watson.

A unique take on HTML ‘validation’, Luke Williams has made finding erroneous markup elementary. It’s a two step process: add the stylesheet to your markup, and add the following class to the <body> tag:

<body class="holmes-debug">

It will catch common errors, like give all <image> tags without an alt attribute a 2 pixel red outline. There are actually three colors, red (error), yellow (warning, potentially bad practice) and dark grey (deprecation). Hovering over most of these gives you a description, so you know what the problem is (or might be).

Checkout the source on GitHub!

changelog.com

multi_xml: Flexible and fast XML parsing in Ruby

An overlooked part of writing a good API wrapper is picking the right parsing library. It’s often a choice between speed and complex platform dependencies. Gems like MultiJSON from Intridea are valuable in providing a common interface to swappable JSON parsers, letting users of your library choose the best fit for their application.

Erik Michaels-Ober took inspiration from MultiJSON and has released MultiXML which aims to bring the swappable parsing approach to XML documents. It looks for the “best” library to use based on parsing speed, looking first for LibXML, then Nokogiri, then Hpricot, and finally REXML. Of course, you can specify your own library of choice.

You can install MultXML via RubyGems:

gem install multi_xml

Then just require it and parse some XML:

require 'multi_xml'

MultiXml.parse('<tag>This is the contents</tag>') # parsed using LibXML if you got it

Or we can specify our library:

MultiXml.engine = :nokogiri
MultiXml.parse('<tag>This is the contents</tag>') # parsed using Nokogiri if you got it

You can set the engine via symbol or class:

MultiXml.engine = :rexml

# or

MultiXML.engine = MultiJson::Engines::Rexml

MultiXML was just released this morning so if you have ideas to add, fork away.

[Source on GitHub]

changelog.com

MooModel: Data Modelling for MooTools

If you’re familiar with Rails and the MVC pattern, then you’d possibly know of the eases of modelling data with an ORM. For a long time, there’s never really been a way to take what we take for granted on the server and do similar within the browser.

Well, not anymore, now there’s a solution that appears to cover all the common issues, it was released earlier this month. Created by Anup Narkhede, MooModel is an ORM and Data Modeller built on top of the MooTools library.

var Post = new Class({
  Extends: MooModel.Base
});

var post = new Post({id: 1, name: "bean", description: "lorem"})
post.save();

Post.find(1); // {id: 1, name: "bean", description: "lorem"}

Currently MooModel isn’t CommonJS compatible and doesn’t work on the MooTools ServerSide project. However, Anup does mention in a blog post about the project that he is working on support for both CommonJS and MooTools Serverside.

The readme for MooModel is fairly light, which doesn’t matter too much, as Anup has written up a quite comprehensive documentation containing the usage of the library on his blog.

[Source on GitHub] [Introduction & Documentation]

JS Party #299

Helping people enter, stay & thrive in tech

Valerie Phoenix from Tech By Choice joins Amal & Kball to tell them all about her non-profit that’s passionate about helping people interested in technology, no matter their experience level.

Matched from the episode's transcript 👇

Valerie Phoenix: Yeah, so I’ve always been really big on doing community work, even through like being really young, in like high school, and middle school, and things like that… I kind of didn’t do that much in college, and when I started making the transition to realize tech is where I was going to go, I wasn’t going to go to grad school, I realized I was missing that. I missed that sense of community, of the shared goal to make things better… And I started to go into more identity-focused tech groups, women in tech groups, and that’s where I found my sweet spot. But there was still something really lacking. I felt like I was still really struggling to break in, despite these groups talking about diversity, about inclusion, about really being a space that I could learn and not struggle and feel accepted in.

It was to the point that I sometimes would say, “Oh, I just won’t pay my phone bill this month, because I want to take this Java Script class. And I can’t afford”, and there was no more scholarships available for me to take the class on my own, so I was constantly making decisions like that. Or if I wanted to go to a meetup group, just to understand and to network I had to make the choice of “Am I going to pay for parking, or am I going to pay for gas?” And just hope I don’t run out on the way there or on the way back. And a lot of the times I opted to pay for gas and hope I didn’t get a parking ticket, that I probably wouldn’t have been able to pay for.

[00:07:54.24] And for my first year and a half I kept having to make those choices, and they kept getting bigger, and the consequences kept expanding. It got to the point where I got a really good apprenticeship program, and I got accepted into it, but the pay I think was $12 an hour, and I was working 20 hours, so I had to cut my full-time job into half, and I took out this really bad loan that had super-high interest rates… And that was my first step into tech, and that was the way that I was doing it. And I still was relying on those communities to provide that extra cushion. There was some support, I did get some scholarships here and there, but that was still my experience with the support.

And so after going through that and after getting my stability in the industry, I realized that we could do more. And I kept pushing for those organizations to do more, and they were just not – I don’t know if they didn’t really understand my experience, or if they did understand that this was common for a lot of people, and that’s why people didn’t make that jump to make the transition into tech even if they were interested… And so that’s where the idea of Tech by Choice came from. I had to make a lot of choices that didn’t benefit me, but in the long run it made a huge difference in what opportunities I had. And so that’s where the organization came from, and why I’m still so passionate about it now.

Changelog Interviews #564

Observing the power of APIs

Jean Yang’s research on programming languages at Carnegie Mellon led her to realize that APIs are the layer that makes or breaks quality software systems. Unfortunately, developers are underserved by tools for dealing with, securing & understanding APIs.

That realization led her to found Akita Software, which led her to join Postman by way of acquisition. That move, at least in part, also led her to join us on this very podcast. We think you’re going to enjoy this interview, we sure did.

Matched from the episode's transcript 👇

Jean Yang: Yeah, and I would say that it’s not even about the future not arriving yet… It’s that some tools are built for a reality that doesn’t exist, and may never exist. And so yeah, how I see it is there’s this notion that everything trickles down from a small set of companies that are doing best practices. And this set of companies tend to be very large, well capitalized, very profitable companies… The Fang, Facebook, Amazon, Apple, Netflix and Google being the models of this is what needs to happen. But it’s not actually trickling down, and not because people are slow to adopt, or because they’re lazy, or they just don’t understand the good solutions… But if you think about it, Google has a set of constraints for their processing like no other company. How many companies actually need to process at the rate of Google in terms of data, in terms of requests, in terms of many other things? Most websites aren’t going to get that many hits in 10 years, what Google gets in a day. And also, there’s other things, like if you’re not set up that way, then it’s not that you don’t have the luxury of having 10 teams to work on, optimizing certain things, or developer productivity… You don’t have the need to do that. And so it’s kind of like – if luxury cars were really lightweight race cars, that were actually dangerous for most people to drive… You know, that’s not a luxury vehicle; that’s just something you don’t need.

So I think that a lot of the influencers talk about – they tell great stories, they tell stuff that would be great for engineers starting out… Any junior engineer learning about how Dropbox did their distributed systems - that’s great education for learning how to do distributed systems better. But most companies don’t have problems of that scale. They don’t need to solve them in the same way. And if they try anything similar, they’re just overbuilding.

[00:44:13.27] So there’s a “common wisdom” among a lot of investors that if you saw it at Facebook, or you saw it at LinkedIn, and you spin it out as a company, it’s going to be successful. I think it’s really worth questioning that, because most companies don’t have problems at that scale; they have problems at a different scale. And so if what you need – so I had a really big realization moment recently, when I was talking with one of my team members, and he had bought a motorcycle. And in my mind I’m like “Oh my God, a motorcycle. So dangerous. Why wouldn’t you get a car?” And he said “I live in Bangalore. You can’t get anywhere with a car, and everyone rides motorcycles. It’s totally different. It’s the only way to get from point A to point B.” And I think there’s a similar reaction sometimes in dev tools, when it’s like “Oh, my God, you haven’t set up this kind of cluster, or you haven’t set it up this way - what are you doing?” But at the level of requests that you actually need to serve to be profitable, and to hit your targets as a company, maybe you don’t need to be doing it that way. And actually doing it that way slows you down, and is impossible.

So I think that even calling these people blue collar workers – I think most developers are not Google. I think people have written a lot of things that have the exact title, “You are not Google, and that’s okay.” But I think we should stop having this idolization of a small set of companies that have problems that no one else actually has. People should stop feeling bad that they’re not solving those problems or having those problems. I think it’s also - side note - a little bit strange that in school we’re teaching people the cutting edge of algorithms… And I think one reason people get really drawn to this is they learn in algorithms class “This is what computer science is”, and then they’re like “Wow, Google is actually applying all of the things they learned in algorithms class to all their problems every day. We should be doing this, too.” But maybe actually there’s other skills that should be taught to you, in side note…

But yeah, software development is a variety of things. Most of it doesn’t look like what people learn in algorithms class, and that’s okay. That’s reality. And it’s not about catching up to the future; this is the present, and the future is going to be more of that. It’s not necessarily writing distributed systems and assembly code that can move at the speed of light.

Practical AI #234

Vector databases (beyond the hype)

There’s so much talk (and hype) these days about vector databases. We thought it would be timely and practical to have someone on the show that has been hands on with the various options and actually tried to build applications leveraging vector search. Prashanth Rao is a real practitioner that has spent and huge amount of time exploring the expanding set of vector database offerings. After introducing vector database and giving us a mental model of how they fit in with other datastores, Prashanth digs into the trade offs as related to indices, hosting options, embedding vs. query optimization, and more.

Matched from the episode's transcript 👇

Prashanth Rao: Absolutely. So I think, before we get into the specifics of databases, I think, to answer Chris’s point, we definitely do need to talk about the evolution, right? I see that vector databases are a natural evolution of the NoSQL class of databases. If you imagine a Venn diagram, you have like a circle that represents SQL, and the other circle represents NoSQL; you have an intersection. That intersection point - I believe they’re called NewSQL now; I’m not sure if you’ve come across that term. It’s quite interesting. But NewSQL - they technically use SQLite languages, but they also claim horizontal scalability, and a bunch of other things related to asset compliance and all the other things. So it marries the benefits of both SQL and NoSQL paradigms. I was thinking initially, “Where do I place vector databases? Does it go in that intersection, or does it sit purely in the NoSQL camp?” Then I imagined this as you extend that circle that has NoSQL, it becomes like a blob, like a fuzzy, amorphous blob. NoSQL is huge, and in my head, vector databases are like an extension to NoSQL. And why they came about - to understand what vectors are, and how they’re stored in a database, I think it’s important to understand what search is, and what essentially you’re doing when you query a NoSQL database.

So where it comes from is, in the early days, I guess people were just submitting an exact query, using a JSON sort of query language, like our MongoDB has… And that query has to have all the terms or parameters in there that tell you what you want to fetch from the database. In a SQL world, it will be done with a declarative query in SQL, whereas in NoSQL, you typically do it in JSON.

Over time, I think the idea of full text search became very important, because I think everyone wants to be able to retrieve information from massive blobs of data sitting around. And how do you query that, right? If it’s in a NoSQL sort of format, you can’t write a SQL query to retrieve it, how do you get that information? So the idea of a full text index came about. And what essentially that is is it uses a concept of inverted indexes - inverted file indexes, sorry - where you consider the term frequencies of terms that appear in a certain document, and obviously, the relative frequency of how often those terms exist in a document, versus the entire dataset.

So you combine all those things together, similar to how [unintelligible 00:13:13.21] is in data science; there’s an algorithm called BM 25, which is the most popular inverted file index algorithm. It’s the most commonly used one for full text search. So the early days of search involved how do you scale that up, because you have massive amounts of data; how you build that index very, very efficiently? And then the query interface sits on top of that, so you essentially submit a query saying, “Okay, so and so done. And the keyword that you put in, and the inverted file index, the BM25 algorithm, it considers the word’s frequency, and it considers subword features, and a bunch of other things to intelligently retrieve relevant documents that contain that term… But also throwing out useless words, stop words, and things like that. So it was more of like a bag of words, sort of… Considering an NLP analogy, it’s kind of like a bag of words way of approaching text.

[14:05] Now, fast-forward a few years, I think ever since the transformer revolution happened, people began observing the obvious power of transformers in encoding semantics. A transformer is way better at isolating meaningful terms in a document, especially when you’re doing things like classification, retrieval, and so on. So how can you merge those benefits of a transformer with what you have in a database?

So I think vector databases - the term got coined, I think, much later, after transformers came about. It was mostly called Search Engines before that, a more generic term, I think a catch-all term for anything that involves search. But nowadays, I believe search engine refers to a more – like, you consider semantics as a key component. So essentially, vectors are the only thing that can do that.

So to really describe what a vector is - essentially, you have a language model, typically a transformer-based language model, that you use to embed the representation of a sentence into tokens, and the representation is stored as a vector. The vector that you have essentially for a particular sentence - typically those are done using sentence transformers, which is the most common kind of model you use. That essentially embeds the entire semantics of that sentence in the vector. And then the way this scales up is you consider the context of each and every token in that vector in a way that when you submit a query, the semantics of the query are mapped to the vector in your database, and you can find a similarity between what you entered as a query, and what exists in the data. So a vector is a very powerful way of, you could say, compressing the representation of meaning in a sentence or a document, in a way that scales up numerically, and you can rapidly query that in [unintelligible 00:15:49.24]

Break: [15:53]

JS Party #276

The ORMazing show

Nick & KBall sit down with the brilliant Stephen Haberman to discuss all things ORMs! 💻🔍

From the advantages and disadvantages of ORMs in general, to delving into the intricacies of his innovative project Joist, which brings a fresh, idiomatic, ActiveRecord-esque approach to TypeScript. 🚀

So sit back, relax, and let’s dive deep into the world of ORMs with the experts!

Matched from the episode's transcript 👇

Stephen Haberman: Yeah, that’s a good question. So I think Joist came around probably circa 2019 or so… So the problem we were solving there at the time was standing up a new tech stack, very stereotypical tech stack at the time, where we had GraphQL on the backend and React on the frontend, and we were using Apollo and Postgres. I love Postgres. And yeah, just trying to find what was the most ergonomic way of standing up our backend. And when looking around for other tools at the time, there’s the class of tools out there, like the Hasuras and the PostGraph files that are super-ergonomic in terms of directly mapping your database schema to your GraphQL public API. Super-amazing. But you know, kind of touched on that business logic thing… We had been wanting to find a way to do like 80% to 90% of like just take your database schema and make your GraphQL API out of that, and do that for the common case… But there’s always this last 10% to 20% where I think like the Hasuras and the PostGraph files, you can start to miss out on “Well, I don’t want my GraphQL API to exactly be my database schema. And so that ruled out those… And yeah, just from my past, I’ve done enough or ORMs, or I had used ORMs, kind of like those. I had used TypeORM in the past, and I was just looking to use something else.

[06:37] But we started out with MicroORM, which is actually really great. I still like it. It matched – and you can tell, like probably 60% to 70% of Joist’s API matches Micro, because our codebase was on Micro for probably six to nine months before we flipped over to Joist.

And the big reason for moving away from Micro - I mean, there was nothing wrong with it, but we were very much in the GraphQL environment, where it’s so easy to do N+1s. And so we were really looking for an ORM that would build in data loading, the whole Facebook Data Loader pattern of you wait until the next event tick to kind of see whatever happened, and then at the end of the event tick you’re like “Oh, you asked for 10 authors in this one event tick, instead of 10 SQL calls. I’m just gonna do one SQL call for all 10 authors, with a ‘Where in’.” And I actually had a pull request into Micro to kind of start to do that… And it worked, but Micro was just mature enough at the time; it was probably already – I really haven’t kept up on it. I liked the Micro author. He was great to work with. But it was probably v – I’m gonna make up v3 or v4… I don’t know, it was a little while ago. But it was already pretty a mature codebase. So it just wasn’t as easy to wander in and like put data loader into the guts of the ORM after it had already been established, and that sort of thing.

And so really, that was it. That, and then the other thing that – from my days working on TypeORM. And again, I haven’t worked with TypeORM for five years at this point… But the biggest frustration I remember, with disclaimers that I have no idea what it looks like these days… But it was that it was incredibly opaque whether your collections were loaded or not; or even any relation. So you might go get an author, and so – oh, to go back to one of the things that ORMs are really good at, is like lazy-loading parts of your object graph as your business logic needs them. So you might start at an author, your endpoint is like “Do something with an author”, and so you get the author, and then you do some business logic, and you’re like “Oh, I need the books.” And then you do some business logic, and you – oh, you need some book reviews. And so ORMs are really good about kind of making it ergonomic to load more and more of your little subgraph as you go. But the trade-off is that you start out with it not loaded. So you start out with an author, and you don’t have the books yet, and you don’t have the book reviews. And my recollection of TypeORM was that it didn’t have a way of representing these two states in the system. And I’m trying to remember, I think you could do things like for the author and the books, tell it that the books is always loaded. But that would mean like every time you touched an author, you de-facto brought back the books. But then at least in the type system you were guaranteed for the books to have been loaded… But you rarely want that, precisely because it’s kind of lazy-loading and the object model is de facto what ORMs are good at.

And so tangenting back to why did Joist come around… So the kind of two a-ha’s that kicked off Joist was “I want to build a data loader from day one, for every single lazy-loaded call.” And the other one was figuring out a way in the type system to represent the two states of a collection that’s not loaded until I ask for it to be loaded with a populate hint, or load hint, which - the load hints and populate hints are not novel to Joist; they go back to Active Record, and I’m sure other ones before, where you start with an author, but before you go in and play with the guts of the author, you say “By the way, I know I’m gonna want the books and the book reviews loaded”, so you give a little hint upfront, to like “Please, go get those for me”, and then your business logic after that can have that.

[10:02] So you’ve always had to do that with ORMs, like the Active Records and TypeORMs of the world. What was novel with Joist about the time was that transition changed the types. So you would start with the author, and you couldn’t do books.get; you would have to do books.load, which was a promise, and then for every book you’d have to do mybook.review set load, and that would be a promise… But if you did – so it’s by default safe, which is one of the things I don’t think I liked about TypeORM, was like by default… Like, the collections would look like you could call get, but it would turn into a runtime exception if you hadn’t made extra-sure that you had done a populate hint 10 or 20 lines up, or we’re even a completely separate method. That’s where I think this can really break down. I remember – even Active Record still kind of has problems with this, where you might have an endpoint that kicks off, and like loads an author in Active Record, and then eventually, you get into abstractions; you call this helper method, and this helper method, and at some point, one of these helper methods is going to need data that you didn’t remember to populate, not only 10 or 20 lines up, but way off in some other method, in your endpoint method.

Yeah, so anyway… So with Joist, just TypeScript map types are just so neat. So I’ve been playing around with a prototype of like “Can I have a domain model that is inherently unloaded, and when overlay this type hint of like “Please ask the ORM to go load the data”, both go load the data from the SQL database, but mark in the text system that all of these are now loaded, and you can do gets.

So I went on super-long tangents, but those are the two… You know, once I had those figured out, both of those seemed novel enough to like “Okay, okay, now it’s worth taking what had been musings, and turning them into an actual project”, and that’s what kicked off Joist.

Practical AI #221

Large models on CPUs

Model sizes are crazy these days with billions and billions of parameters. As Mark Kurtz explains in this episode, this makes inference slow and expensive despite the fact that up to 90%+ of the parameters don’t influence the outputs at all.

Mark helps us understand all of the practicalities and progress that is being made in model optimization and CPU inference, including the increasing opportunities to run LLMs and other Generative AI models on commodity hardware.

Matched from the episode's transcript 👇

Mark Kurtz: Yeah, definitely. And I’ll take two steps to doing that. One is just covering kind of the 90%, 95% class, at least where we’ve been able to get to on those, and the second is looking specifically at large language models. So for the first one, whenever we’re looking at getting rid of 95% of weights - let’s take ResNet-50 as an example. This is our toy benchmark model, this is essentially we’ll hit – we prove out all of our technology on, because it’s a common feature in MLPerf, and for most performance to us.

So what we can do coming in is looking at those convolutional layers; it has - I forget how many million parameters with that, but it’s definitely not the 3 billion, 7 billion or up on top of that. But within that, we can actually zero out; so what we’re doing is taking all – imagine taking all those parameters and dumping them into a giant array, and we’re just going to zero out the ones that are not important. And figuring out the ones that are not important is part of the research. The easiest assumption is just saying that the weights that are the largest are the ones that you want to keep. So the ones that are furthest from zero are the ones that you want to keep. Generally, you can think of those in two ways. One is that as the model’s training and being regularized, the weights that don’t matter are going to move toward zero. And then the other thing is during that forwards pass, the weights that are higher magnitude have more of an effect on the output. And everything else is going to be noise in between.

So we’re able to essentially get rid of just – and whenever I say “get rid of”, I mean setting those parameters to zero within 95% of them. So you’re left with 5% of your weights that are nonzero. And that’s actually all that you need to preserve the accuracy on ImageNet for ResNet-50, for example.

And some quick kind of intuition in terms of how I’ve been able to think about this, and why it works, and things like that, you can see as we increase the size of our dimensionality in our optimization space, what we’re doing is - and there’s a few research papers out on it - that we’re able to connect more of the local mins. So the optimization process will slowly converge further and further down, because more of the local mins are connected. Generally though, there’s only a few of those pathways that you actually need to connect those local mins; so all that we’re doing is we’re following down that most optimized pathway and removing everything else around us in terms of that dimensionality.

So it’s kind of one of those things that as you’re training, it’s slowly selecting the weights that matter, that get you down to that local min. And there’s very few – so the important part was that large dimensionality of the optimization space, but not every direction mattered, right? So then we can get rid of it.

[14:01] And then diving in on the LLM side, large language models - we actually have a recent paper that came out from one of our principal research scientists, Dan Alistarh, called SparseGPT. And that’s where we’re looking at taking OPT and BLOOM models all the way up to 175 billion parameters, and being able to optimize those and remove as many weights as possible, all in this case in one shot. So just using the model without any retraining, we’re able to get rid of around 60% of the weights without doing anything. And there’s a new paper out of Cerebrus, actually, that was looking at the LLM story, and they’re able now to get to 80% sparsity on these LLMs with retraining.

So that’s kind of the research direction that we’re headed down now, is proving out how optimized we can make these models. Because there’s also a lot of interesting stuff that happens with the large language models, specifically because it’s generating one token at a time, very latency-bound, and that means that it’s a lot of memory excess to load those weights. So if you can quantize those and then get rid of half of them, you’re already at anywhere from a 4x to 6x speed-up just on your inference times. And that’s generally where we’re focused and looking at currently, to try and get those LLMs to run faster.

The other thing to call out for those too is 7 billion parameters, and 175 billion parameters - those don’t fit in a single GPU. So now you have clusters of GPUs to serve one model. And a lot of that compute is just completely wasted, because all that it’s going to is trying to maximize the memory on the GPUs. For CPUs, you can throw a few terabytes on there and it works out fine. So that’s the other thing I’d call out with the LLM in terms of GPU versus CPU.

Break: [15:52]

Changelog Interviews #522

The principles of data-oriented programming

Jerod is joined by Yehonathan Sharvit, author of Data-Oriented Programming, to discuss the virtues of treating data as a first-class citizen in our applications and the four principles that make it possible.

Matched from the episode's transcript 👇

Yehonathan Sharvit: Because almost every developer that has worked in a production-ready object-oriented system has suffered from huge class hierarchies, and you inherit from something that inherited from something, and when you want to make a little change, you influence so many things that it’s a nightmare of complexity. And also for code reuse. If you have a method of a class that does - I don’t know, calculates the full name of a user by concatenating first name and last name, if you want to use this piece of code for calculating the name of an author, which happens to also have a first name and a last name, you need to have author and user inherit from a common object, that you call person, or that you call human being, or that you don’t know how to call it exactly, and sometimes you can do it, and sometimes you need multiple inheritance… While the only thing that you need is the ability to call a piece of code. And you cannot really do that OOP in a simple way. There are tricks and design patterns etc, but in the most straightforward way, code is kind of in jail inside the objects that wrap it. And we want freedom. We have a political agenda; we want to free the world. And we don’t want code to be in jail.

Changelog News #7

Chapters, PiBox, using one big server, oncall compensation, being swamped is normal, Tabler & Gum

We add episode chapters to the website, KubeSail sells a PiBox, Nima Badizadegan wants you to use one big server, Gergeloy Orosz details oncall compensation across the software industry, Greg Kogan isn’t impressed with how swamped you are at work, a dashboard template built on Bootstrap & Charm releases a CLI tool for shell scripts.

Matched from the episode's transcript 👇

Jerod Santo: I tried, I really tried. Nima writes on the advantages of using one big server over a distributed system architecture. Nima says, “We have all gotten so familiar with virtualization and abstractions between our software and the servers that run it. These days, “serverless” computing is all the rage, and even “bare metal” is a class of virtual machine. However, every piece of software runs on a server. Since we now live in a world of virtualization, most of these servers are a lot bigger and a lot cheaper than we actually think.”

The article covers the capabilities of one server, what it’ll cost you, how scaling up is easier than scaling out, and rebuts a bunch of common objections to the One Big Server approach.

JS Party #199

Ship less JavaScript, closer to the user

KBall catches up with Chris Ferdinandi about the trends in modern web development towards smaller libraries, pre-compilation, and applications at the edge.

Matched from the episode's transcript 👇

Chris Ferdinandi: And we saw that with ES5, where jQuery for years was the way you build things with JavaScript on the web, because it made stuff that used to be really hard, like selecting an element by something other than an ID, or adding and removing classes, really easy. And then we got querySelector() and querySelectorAll(), and the Class List API. Using an embedded video and web page used to require really complex JavaScript libraries. Now we have the video element.

There’s just all this really cool stuff that’s getting pulled into the platform, and I’m seeing more and more of that starting to happen. I don’t think we’re anywhere near where we need to be, but a lot of these tools that I hate have a really important role of paving the cow paths and showing what we could do. And then that stuff hopefully eventually gets pulled into the standards process and moves into the platform, and that’s where the complexity lives. And it becomes easier for you, the developer, it becomes better for the end user, because it’s already built right in…

I’m even seeing this now, there’s been talk of interactive components, like tabs and carousels and accordions, having native elements, so that you don’t have to roll your own or grab a library every time you want to add these very common user patterns into your interfaces. And so that for me is, I think, the trend that I’m most excited about, but I think it’s also the longest way out. And so I’m excited, but I’m also like, “Okay, you need to be really patient, because this is not going to happen anywhere near overnight.”

Go Time #197

Books that teach Go

Natalie sits down with Go book authors Bill Kennedy & Sau Sheong Chang to discuss the ins and outs of writing (and reading) books about Go!

Matched from the episode's transcript 👇

Natalie Pistunovich: You both mentioned that you agree that there’s a lot of general Go content out there, and we should write, as a community, books about specific things like the standard library.

I want to take this one step back and ask about the – still the idea of teaching programming with books. So I guess I started in university computers, we had lots of different books, so it’s kind of usual for me. But friends of mine who went to bootcamps to learn programming, because their original profession is something unrelated - it’s less common to learn, for example, in the bootcamps that I know of, with books. You learn more with interactive content, and a little bit in-person class, or Zoom calls, and a little bit just programming exercises with the computer. And in some way, it’s similar to how your job is going to be, in the sense that you will sit in front of a computer and will be writing code. And it’s interesting, the idea of teaching something as interactive as programming with a book. Do you think this is something that will stay as we move to more content online, in the variations of recordings or live trainings, and TikToks maybe…?

JS Party #175

This is ReScript

Ever wanted a language like JavaScript, but without the warts, with a great type system, and with a lean build toolchain that doesn’t waste your time?

Patrick Ecker from the ReScript Association sits down with Jerod and Feross to tell us all about this “JavaScript-like language you have been waiting for”.

Matched from the episode's transcript 👇

Patrick Ecker: Yeah, so the most important thing is we have everything on our website, ReScriptlang.org. We’ve hopefully structured the documentation in a way that it’s easy to follow. We usually recommend to start out with the manual, which is more written in a narrative style. So you start at the introduction, and you go through the installation, and then you can dive through the most basic features.

We didn’t mention the more advanced features yet on the manual, because we thought it would be easier for most users to just use the most basic stuff, and they can build all kinds of React apps with that. After you get some idea of the language, it is very important, because a lot of people are coming from a JavaScript background. They want to interact with existing JavaScript libraries. We’ve got a section there for interoperability for the external bindings I was talking about… So you need to get familiarized with “How do I bind to an ES6 module? How do I bind a value to a function that is exported from a CommonJS module, maybe the default export?” Or how do I map to a JavaScript class there? This is probably the first thing you should definitely check out, and at least try… Because a lot of people have this urge to jump into the ecosystem and look for existing ReScript bindings that bind to some JavaScript library, but this takes away a lot of learning possibilities. If you’re in control of writing your own bindings to externals, it’s much easier to do stuff, generally speaking.

As soon as you’ve got this, we also have a ReScript React documentation section, which covers actually all the basic concepts of React. We tried our best to also cover all the React topics, because oftentimes we just refer – like, some people refer users to the original resource, so “Go to reactjs.org, learn React, and then come back to our resource and learn ReScript React.” We don’t do that. So you can go from topic to topic there. How do I use JSX? How do I create elements? How do I mix them up with existing components and how do I export it to JavaScript so I can use it in JavaScript? From there on, it is usually recommended to just drop into your React codebase, run npm install ReScript in your dev dependencies, and set up a config file with Npx ReScript in it, and just point to the folder with your components. Then create your first ReScript file, write your first component, try it out, and wire it up in your existing JavaScript app, and that’s it. From there on, you can just play around with all the features and see if you like it.

JS Party #146

Redux is definitely NOT dead

Redux maintainer Mark Erikson joins Jerod and Amal for an in-depth conversation around the React community’s fav state management solution. We learn how Mark came to be maintainer of Redux, why and how Redux Toolkit came about, when to go with Redux vs other options, and much more.

ALSO: prop drilling, the grep factor, & lasagna mode (oh my)

Matched from the episode's transcript 👇

Mark Erikson: Sure. So after writing the Redux FAQ in the spring of 2016, I followed that with a recipes section called Structuring Reducers, which gives some guidelines on things like “Why do we split up reducer logic into multiple functions? What are some ways that you can organize that reducer logic? And one of the patterns that I’d seen being used just in the first year of Redux’s existence was this idea of normalizing your state, which generally has two aspects of it. One is that you don’t wanna have duplicate copies of data being kept in the store.

[35:59] If we go back to that blogging example - so we’ve got users, posts and comments - every post probably has the user who created it. And if we fetch that data from the server, every post object might have a separate copy of the users object nested inside. We don’t want to store 50 copies of the user object in the Redux state, we just want to have one copy of the object per each user. And there’s a lot of cases where we want to be able to find a given user, a given post, a given comment by their ID.

So normalizing state generally implies that you’re going to store things as a look-up table, where the keys are the IDs, and the values are the items themselves, rather than storing them as an array. And so I wrote a docs page called “Normalizing state shape” that describes what this pattern is, and specifically suggest it as a good idea.

Despite that, we never included anything in the Redux library itself that ever helped you with the process of normalizing state in any way. There was a very popular library that’s been used with Redux called Normalizer, which I think Dan either started or helped maintain for a while. There’s also a library called Redux-ORM, which provides a class model-like facade over the plain data in your Redux store; and I did use that on one of my projects. But there was nothing built into the core library itself.

So after we’d built out the initial APIs for Redux Toolkit, earlier this year I was starting to think about that idea of normalization as a problem space, that we ought to supply something to help with. So I was looking over various packages and third-party libraries other people created that help with that in some way, and I ran across something that the NgRx store people had created. NgRx is basically a reimplementation of Redux for the Angular ecosystem, built around the RxJs package. And because of that, there’s a lot of overlap in the kinds of things that both Redux and NgRx do.

The NgRx maintainers had created an add-on called createEntityAdapter, which basically it provides a set of pre-built reducer functions for things like add one, add many, set all, upsert one, remove all etc. The typical CRUD-type operations you would do on a set of data. And I looked at it, I’m like “You know what - this package only has one or two references to NgRx at all. It’s almost library-agnostic. Is there any way we could make this reusable, so we could start using it with Redux Toolkit?” And they started looking at it, and I started playing around with it myself, and I ended up actually kind of porting it over and half-rewriting it. I added the use of Immer inside, the arguments for their functions were in the update,state order, instate of state,update… So I switched them around, so we could actually used them as reducer functions.

So ultimately, I ended up porting it, but none of that would have existed if the NgRx folks hadn’t created it in the first place… And it’s really cool to see that cross-pollination of ideas going back and forth, because NgRx was inspired by Redux, our createEntityAdapter, was a port of theirs… So it allows you to skip having to write reducer logic in a lot of cases for the most common kinds of update scenarios that you might be dealing with when dealing with a collection of some items. And you can either use them as the entire reducer function for a given action type, or you can use them as helpers within a larger reducer function as part of the logic that you’re writing.

JS Party #139

Best practices for Node developers

Node.js development began a bit like the Wild West, but over time idioms, anti-patterns, and best practices have emerged. Yoni Goldberg’s Node Best Practices repo on GitHub collects, documents, and explains the best practices for Node developers. On this episode, Yoni joins us to discuss.

Matched from the episode's transcript 👇

Yoni Goldberg: Yeah… So I guess that malicious or challenging input is a big topic, but first - I do agree with you about TypeScript, but I would phrase it in a more generic way. This was very important advice in Node.js land three years ago; now it’s almost common wisdom - you should have some mechanism of presenting the schema that you’re expecting in the request… Whether it’s a TypeScript or a JSON schema, or a class validator becomes now very popular… [unintelligible 01:00:36.18] something that should limit the attack surface and the – you know, it’s funny… I think that I tried many, many times - at least until a year or two ago - to do a very simple attack against a Node.js application. You know that there is some post route - just send a custom JSON, with other property. And a big portion of the application just crashed after this, because someone was doing that – I expect some property dot other property, but it wasn’t there, so there was an uncaught error. So yeah, validation is obviously one tier, and our linters that will catch (think like) SQL injections, or an [unintelligible 01:01:18.19]

One of the interesting stuff is [unintelligible 01:01:22.18] get dynamic input from the user. So it’s not a structured request, with a specific JSON schema; it’s kind of free-form content user [unintelligible 01:01:34.03] How do you treat that escaping of the string? Should it happen on the first tier before saving to the database, should it happen on the upstream, when we send back to the user? One of the things that we learned is that this type of escaping should happen on the upstream; in other words, when you return it back to the device user that is querying for the information. Because escaping is a platform-specific thing. You escape differently for browsers, you escape differently for mobile applications, for some platforms you don’t know how to escape… So typically, you should save in the database the raw information - obviously, after you have ensured there’s no SQL injection there; and if you used the right ORMs, or tools, wrappers, there shouldn’t be an SQL injection. But generally speaking, the raw content should be stored in the database, and the sanitizing/escaping should happen as the content is served back to the user.

Practical AI #99

Attack of the C̶l̶o̶n̶e̶s̶ Text!

Come hang with the bad boys of natural language processing (NLP)! Jack Morris joins Daniel and Chris to talk about TextAttack, a Python framework for adversarial attacks, data augmentation, and model training in NLP. TextAttack will improve your understanding of your NLP models, so come prepared to rumble with your own adversarial attacks!

Matched from the episode's transcript 👇

Jack Morris: Yeah, absolutely. It might help for me to talk real quickly about that systemized [unintelligible 00:32:25.03] the components, and then I can explain the most common use cases… Because obviously, you can pull out any one of the components and use them for your own purposes. So one thing that we really focused on in TextAttack is trying to make it work out of the box. For example, those counterfitted word embeddings, instead of going to this website, downloading it, unzipping it, moving it, finding out how to load all the data, you just import TextAttack and do “textattack.the-class” and just initialize it and it will download everything for you… Which I think is really cool.

If you guys know about Hugging Face Transformers - a lot of the TextAttack stuff is built around transformers and tokenizers, and now this dataset loading library called nlp, which I’m very grateful for… We kind of tried to follow the same model. So instead of having all these files you manipulate yourself, you pretty much just reuse other people’s, and it saves a lot of time.

The easiest or probably most common way that I would imagine people use TextAttack down the line is for things like that, for embeddings. Or another very common thing is sentence encodings, which is something I mentioned at the beginning of this talk. There’s so many different methods for taking a sentence and encoding it into a fixed-length vector; whether they’re very effective or not is a question, but they’re useful in a lot of situations…

So one thing TextAttack has done is just sort of abstracted them into classes that work by themselves, so you could just – for example, if you were doing some project… I don’t know, you wanted to look at a bunch of Airbnb reviews and cluster them based on which ones were similar, you could just import TextAttack and then just call [unintelligible 00:34:13.05] and then give it the list, and it would just do it for you, which I think is pretty valuable.

I’ll tell you what the components are very quickly. There’s four, and we have our own names for them, which I think increases the learning curve a little bit… But there’s some benefits, I think, to having around terminology. So it’s all based around this idea of the NLP attack as a system, which is taking the text input, looking for changes you can make to it, making sure those changes are acceptable, and then whenever you have decided you fool the model, you stop.

The first component would be what we call the transformation, which is taking an input and changing some of the words or characters. One transformation would be substituting words with their counterfitted word embedding neighbors. Then once you do that transformation step, there’s also this idea of a constraint, which is trying to make sure you didn’t make any mistakes.

A common constraint is use a sentence encoder. A popular one is called the Universal Sentence Encoder, which is by some folks at Google… And you encode the original input and now your potential adversarial, and make sure that the sentence encoder also says they’re very similar. It’s basically like a sanity check to make sure you didn’t change the meaning, or change too many characters, if that’s what you decide…

And then there’s two other components. So we had the transformation and the constraints… And you have to define your notion of whether you fooled the model or not. A common thing would just be change the classification output, or change the classification output to a specific class. Those would both be examples of what we call the goal function.

I think a really cool one that I wanna explore more in the future is with sequence to sequence models, like a machine translation model. Your goal might be to take the original output translation and change as many characters as possible.

[36:14] Say you’re translating a sentence into French; you would have your original translation, and if you could substitute a word from the input with a synonym, and then it produced a translation that was totally different, even just in terms of characters, or its Blue score, that would be pretty telling, and probably very bad for your translation system… So that would be another goal function, would be trying to minimize the Blue score.

And then the last component is called the Search method. That’s basically like if you have the input and you have all these transformations, how do you decide which one to keep? Which is important, because if you just tried all the combinations – I mean, if you have an input of ten words and each word has 50 neighbors, you end up with 50 times 50 times 50 possible substitutions that you might wanna combine… So the space grows exponentially very quickly, so you have to come up with some sort of greedy, or approximate heuristics for doing that. That’s what we call a Search method.

So you can combine those four things into an attack, in NLP what we call an attack, which is just a search for adversarial examples that meet the constraints and fool the model as defined by the goal function. But there’s some really cool other things that come off of that. A big one that I’ve been talking to people about recently is data augmentation, which is also a very under-researched field in NLP; it’s another thing that is pretty commonplace in vision, and almost everyone does it… You know, if you wanna train a state of the art vision model on CIFAR-10, or ImageNet, or some other dataset, you’re gonna do some sort of augmentation to change and increase the size of your dataset.

With TextAttack, if you have this transformation which can find maybe semantics-preserving changes to your input, and you could add on constraints, which make sure that they preserve semantics, then you can end up with some pretty good tools for data augmentation, just from those two things. And since we’re trying to implement more components, that would hopefully grow the list of potential augmentation modules as well. So yeah, that’s something I’m really excited about, just data augmentation.

Go Time #134

Beginnings

Mat Ryer talks to a new full-time Go programmer, an intern at Google, and a high-school programmer about the tech world from their perspective.

Matched from the episode's transcript 👇

Shaquille Que: For me it’s goroutines and channels, and sort of the idiomatic way to use concurrency in Go. I think the patterns that they want you to use for concurrency is very good in terms of how you can avoid a lot of the common pitfalls with parallel programming… In particular, I just finished a class last semester on parallel programming and different patterns, and I find myself translating a lot of those patterns into how they would work in Go, and thinking “Hm. That gives me a better model of how this pattern really works.” And kind of contrasting it with Go, I can see why people encourage you to use channels, rather than for example giving new text everywhere.

Changelog Interviews #389

Securing the web with Let's Encrypt

We’re talking with Josh Aas, the Executive Director of the Internet Security Research Group, which is the legal entity behind the Let’s Encrypt certificate authority. In June of 2017, Let’s Encrypt celebrated 100 Million certificates issued. Now, just about 2.5 years later, that number has grown to 1 Billion and 200 Million websites served. We talk with Josh about his journey and what it’s taken to build and grow Let’s Encrypt to enable a secure by default internet for everyone.

Matched from the episode's transcript 👇

Josh Aas: Yeah. The next step is we need to rewrite all the software that we already wrote in C and C++, and replace it. And when I tell people that, the most common reaction is like “You can’t possibly expect us to rewrite the world. That’s so unreasonable. You’re not a realistic person when you say that.” And you know, I really strongly object to that reaction. We’re in a world full of talented people who care, and we can absolutely accomplish that if we want to.

If your goal is to rewrite a major web server or a major proxy server, or a major library or whatever, in Rust - let’s just do it. Yeah, it’ll take five years, it’ll introduce some logic bugs along the way that will get fixed, but in the end, this software is gonna be around for a very long time. And we need to eliminate that massive class of bugs, because vulnerability scanning, and audits, and static analysis, pentesting - that stuff doesn’t even begin to deal with the problem. It’s a good thing to do if you’re stuck with C and C++, but it’s absolutely not gonna eliminate the bugs. That’s just not gonna go away until you rewrite it.

[01:16:05.22] What we’re doing right now, where we just spin up giant piles of C and C++ without thinking about it is – we should not be doing that. We can’t be doing that 10-20 years from now if we wanna try to have a more secure world than we have now. So I think we need to think bigger. We just need to think like “Yeah, let’s rewrite the world.” Rewriting a big web server is a big project, but I’m sure there are teams at any number of companies that could accomplish it on their own without a help, if they just decide to do it. Yeah, it’ll be five years, but whatever; five years from now, you put in some effort, and now you’ve got a much more secure software system.

So I’d like to just see some more ambition and some more optimistic thinking about this stuff. I think it’s really important. I don’t wanna be suffering from buffer overflows in everyday software that sits on the network edge 10-20 years from now.

Changelog Interviews #388

The 10x developer myth

In late 2019, Bill Nichols, a senior member of the technical staff at Carnegie Mellon University with the Software Engineering Institute published his study on “the 10x developer myth.” On this show we talk with Bill about all the details of his research. Is the 10x developer a myth? Let’s find out.

Matched from the episode's transcript 👇

Jerod Santo: …so maybe it’s very common and I’m out of the loop, maybe it’s old-fashioned, I don’t know… But tell us about that, and the kind of people that were in the class.

Go Time #112

defer GoTime()

Mat, Carmen, and Jon are joined by Dan Scales to talk about Mat’s favorite keyword in Go - defer. Where did the defer statement come from? What problems can it solve? How has it shaped how we write Go code? How are other languages solving similar problems? And what exactly was changed in Go 1.14 to improve the performance of defer?

Matched from the episode's transcript 👇

Dan Scales: Yes, exactly. So you may allocate just a normal object, for instance, and it has a constructor, and you declare the variable the beginning of the block, and if that class of that variable has a constructor, you run the constructor at the time that you enter the block. And then C++ guarantees that you will run the destructor at the end of the block, and that may deallocate sub-objects or whatnot… The main thing is it guarantees it no matter what, whether you return early from the function out of the block, or also, again, like defer, if you’re panicking. And that’s especially important, just like defer, if you’re holding on to a resource, which is the common case, whether it’s a lock or a file.

In C++ one of the acronyms that’s used that came from Bjarne Stroustrup is “Resource Acquisition Is Initialization”, which is called RAII… But in any case, he’s basically just saying that you can express acquiring a resource, and then guaranteeing that you’re gonna release it at the end of the block by initializing a variable. So what people do is, for instance, they might have a class which is basically a lock, and they acquire it at the beginning of the block, and then just by exiting the block, the lock is released.

[12:12] All that was kind of a description to say, well, C++, and especially GCC, has made that overhead basically zero. They do the right thing; they generate code at the end of the block, that just calls the unlock call. So it’s a very little overhead for that. And then they do the extra work to make sure it happens at panic time. If we can get closer to that all the time, then people don’t have to think about it for defer as well.

Practical AI #47

GANs, RL, and transfer learning oh my!

Daniel and Chris explore three potentially confusing topics - generative adversarial networks (GANs), deep reinforcement learning (DRL), and transfer learning. Are these types of neural network architectures? Are they something different? How are they used? Well, If you have ever wondered how AI can be creative, wished you understood how robots get their smarts, or were impressed at how some AI practitioners conquer big challenges quickly, then this is your episode!

Matched from the episode's transcript 👇

Chris Benson: To bring this back full-circle on that, if any of our listeners have taken classes from maybe NVIDIA’s Deep Learning Institute, or maybe Coursera, on specific things like NLP or computer vision etc, chances are in that class one of the things you did when you started creating the models for your class was they would have you go in and select an architecture to base that on. That itself is transfer learning. You’re gonna find libraries of these models that are pre-trained, that you can build upon, in all the common frameworks out there. TensorFlow has them, PyTorch has them… It is truly the most common way, certainly to get started or to build upon.

In my own experience, I have more often than not seen people use transfer learning in their work than start from scratch and try to build things completely from the ground up. You would have to do that if there was not the right type of model that you can build upon, but this is normal stuff. This is what we do, and I thought your analogy, Daniel, in terms of using libraries if you’re a programmer, you’re truly using lots and lots of code that other people have built. Maybe a lot of that is open source, maybe some of it is proprietary, but you’re still using those APIs to build whatever thing you’re building, whatever application you’re building… That’s a fantastic analogy you gave, on matching it up to transfer learning in ML.

Changelog Interviews #347

Creating and selling multiplayer online games

We’re talking with Victor Zhou about the explosion of the .io game genre. We talked through all the details around building and running one of these games, the details behind Victor’s super popular game called Generals — which he eventually sold, and we also covered the economics behind creating and selling one of these games.

Matched from the episode's transcript 👇

Victor Zhou: Sure. Definitely, if I had to say the thing that all of these .io games share is that they use websockets. That’s pretty much the only way that you’re gonna be able to get the real-time communication that you need to build one of these web games. I personally used this nice JavaScript socket library called socket.io. I’m not sure if you guys have heard of it, but it’s definitely the top socket library out there right now. We use that in my post, and it makes it really easy to use websockets.

Then on the server side I personally also run just Node.js. My reasoning for that is that I want to be able to share code between the client and the server. So if everything is written in JavaScript, it’s much easier to not have to rewrite stuff. You can imagine I write a class for a player, or something, and I want to be able to use that class on both the client and the server, because the server is the one that’s doing all of the game simulation, but the client also needs it, because 1) it needs to be able to understand and parse information that the server sends to the client, but 2) you also want to be able to do a little bit of simulation on the client side to kind of mask the latency that you’re gonna have.

A big problem with these games is that you can’t use UDP on the web. Everything is TCP, everything is reliable, everything is ordered, but the issue with that is you’re gonna have head-of-line blocking sometimes. So if one game update doesn’t show up to a player, the entire game is gonna freeze for a little bit, as the internet figures out what it’s doing. And then the rest of the game updates are gonna flood in at the same time. There’s just no way around that right now, and there’s a lot that goes into making sure that the client-side experience is as smooth as possible, even though latency is gonna be weird; you’re gonna have weird ping spikes… You might be a player in Brazil, playing on the New York server. I definitely spent a lot of time doing that… So having shared code makes that a lot easier, and helps you get the development done and helps you push the game out faster.

Other than that, you’re gonna probably have a database of some sort if you keep players’ stats, which you might not necessarily do… For example, Generals has this kind of rating system, so you need a database to do that. That’s nothing special; you just have something running. You can store player information in that.

And then also I’ve been talking about this replay feature that I had with my other two games - I believe some other games have it, but I don’t think it’s so common right now… But the way that I’ve been implementing that is just, like I said, storing those replays in an AWS bucket, and then downloading those when I need them.

[44:11] That’s about it… Client - JavaScript. Server - JavaScript. We have websockets for the communication channel between the two, and then we’ve got some database and some other storage solutions behind the scenes to make it all work together.

JS Party #65

Building rapid UI with utility-first CSS

Panelist Jerod Santo and first-time panelist Adam Stacoviak talk with Adam Wathan of Full Stack Radio fame about his CSS utility library called Tailwind CSS that’s growing in popularity to rapidly build custom user interfaces.

Matched from the episode's transcript 👇

Adam Wathan: So the solution with Tailwind there is not to go and create a class at the very beginning; the idea is you wait for duplication to happen, just like when you’re writing real code, when you’re programming - you wait for duplication to actually show up, and then you extract that duplication to avoid the maintenance burden. And there’s sort of two encouraged paths to doing that in Tailwind. The truly CSS-driven Tailwind way to do that is using this feature of Tailwind called @apply, which is like a custom @ rule in Tailwind.

An @ rule in CSS, for anyone who’s not familiar, is something like a media query is an @ rule. It’s got an @ symbol, and then some text after it. @import is an @ rule, @charset is an @ rule… So in PostCSS anyways, which is what Tailwind is sort of powered by under the hood, it will parse your CSS and let you walk all of the @ rules or walk a filtered set of @ rules and manipulate those in abstract syntax trees.

[27:37] What we essentially do is we have this custom @ rule called @apply, and PostCSS doesn’t know that it’s not valid CSS, which is the whole secret sauce really to doing fancy stuff with PostCSS… But essentially we walk your CSS looking for instances of @apply, and what @apply does in Tailwind is it lets you say like – you could create a class like “doc card”, and then inside of it you would just say “@apply”, and after @apply you would just dump a list of class names. So you might say “@apply bg-white p4 rounded-md shadow-md border-gray” whatever. So maybe you’ve got five or six class names there, and what Tailwind does in its processing step basically is it treats all those classes exactly like Sass mix-ins, and it takes the definition of those classes and inlines them into that card class.

So the workflow ends up being you have two cards in your HTML that have the same classes, and you think “Man, I don’t wanna have to maintain these two lists in sync. I wanna create an abstraction.” You basically just select all the classes and the class attribute, cut them, go over to your CSS, come up with a name - which is a lot easier now, by the way, because you have two instances of it and you can sort of think in your head “What do these have in common? What’s a name that actually applies to both of these?” You come up with a class name like “card”, you type @apply, you paste in the list of classes and save the file, and then you replace the class attribute on those two elements with “card” instead of that list of classes now, and now you’ve basically extracted a component class out of a list of utilities.

The nice thing is the whole thing is still built on that underlying design system that you’ve sort of been using for this site anyways, so there’s no weird magic values or anything in there. You could add custom CSS, and sometimes that’s necessary, but generally this workflow is just extracting these classes into a component class to sort of freeze them into this reusable unit, and then applying that in your HTML.

Then the other approach, of course, is if you’re working on something like a React app or a Vue app or something, we already have primitives for reusable pieces of HTML which are components. So instead of creating a card class, you might just make a card React component, or a card Vue component. Then that list of six or seven utility classes is still only defined in one place - it’s defined in that component, so you don’t have a duplication problem anyways, so there’s no actual pressure to even solve that problem.

Practical AI #11

Robot Perception and Mask R-CNN

Chris DeBellis, a lead AI data scientist at Honeywell, helps us understand what Mask R-CNN is and why it’s useful for robot perception. We also explore how this method compares with other convolutional neural network approaches and how you can get started with Mask R-CNN.

Matched from the episode's transcript 👇

Chris DeBellis: [24:15] Yeah, it’s a huge problem. If you think about the simpler example of classifying an object, so “Is this a cat, a dog, a person?” If you were doing training on those images, you could do something simple like create a directory for each type of object. For instance, you have a directory called Dog, and that directory name becomes the object name, the class name, and you put all of your pictures of dogs into that directory, and you train. That’s your labeling. But to do something like detecting the bright location of the bounding box, you would have to take those images and draw the bounding box around the individual objects, and then train.

Extending that further to something like mask, since you want to get accurate masks, you can’t just draw bounding boxes around each of the objects; you have to draw the actual outline. So you end up generating a polygon typically, some really odd shape, enclosed outline for each of the objects. So if you had an image, say, of four cats and four dogs, that’s eight objects you have to outline… And it becomes really tricky when they’re occluded, or one is in front of the other, so it’s only partially showing, and you have that common boundary between the two. You wanna be really accurate when you do that. So yeah, labeling or annotating data for masks is cumbersome and tedious.

Changelog Interviews #308

Biases in AI, helping veterans get jobs in software, open science

Adam and Jerod are on location at OSCON and talk with Camille Eddy about recognizing biases in AI, Jerome Hardaway about the work he’s doing to prepare veterans for jobs in software, and Abby Cobunoc Mayes about the work she’s doing at Mozilla for open science.

Matched from the episode's transcript 👇

Jerome Hardaway: Yeah, same deal. We start the conversation, because I wanna make sure you’re a good fit. We’ve had companies come in and being like “We love what you’re doing; we have colleagues who have hired your people. Would you mind doing Java?” and I’m like, “No… You don’t understand how hard it is if I’m not actually there in front of that veteran to be able to get their machine prepped to do Java and Java Spring Boot well. We have to control the install phase. That’s why we chose JavaScript; the ease of use of being able to get that veteran from not having a dev environment to having a dev environment is super easy in JavaScript, versus more stable languages… It’s like, “Okay, it’s very difficult to do that, so let’s work on this, and then as they get interested, they’ll be able to have this base of knowledge, then they can build on it.”

We had a veteran right now - he last week started his first day of work at J.P. Morgan as Angular and Java Spring Boot developer. We don’t teach Angular, we don’t teach Java Spring Boot, but he was able to get that job because of the deep knowledge base he got with us, and then being able to go and venture out on his spare time outside of class, with Java. I was like, “Alright, that’s awesome! I don’t care what you do, as long as you’re programming. Cool! You’re building. Never stop, dudes.”

That’s another thing that programming has in common with boxing - you stop for a week and you pick it back up, and you will feel it.

Changelog Interviews #224

.NET Core and Microsoft's Shift to Open Source with Bertrand Le Roy

Bertrand Le Roy joined the show to talk about all things .NET Core, their recent 1.0 release, where it’s going, the open source around it, and Microsoft’s shift towards more open source.

Matched from the episode's transcript 👇

Bertrand Le Roy: So there are definitely things that are going to happen in terms of what implementation of the base class library each is using, and there is some convergence going on, so that we actually don’t maintain three different codebases; there is a lot that is being put in common, obviously. But we still have the runtime itself, and we have great implementations of .NET running on iOS and Android, and I’m not sure why exactly we would necessarily convert those on .NET Core. But I don’t know. Maybe. I really don’t know. It might happen at some point. You would have to ask the question to somebody else than me.

[52:16] It’s also a problem of where you put your focus and where you put your energies. We have many things to do, and everything takes time.

Go Time #302

What's new in Go 1.22

Our “what’s new in Go” correspondent, Carlana Johnson, joins Johnny & Ian to discuss what’s new with the latest iteration of Go in version 1.22.

Matched from the episode's transcript 👇

Carlana Johnson: Okay. Yeah, this is a very looping release of Go. So there’s one change that is official, and there’s another change that’s experimental. So the official change is that now you can say [unintelligible 00:06:08.11] and then use an integer.

[00:06:15.24] So if you’re used to those old-fashioned C-style loops, where you say [unintelligible 00:06:18.05] the classic three-expression for loop, you don’t have to do that anymore. Now, you can just say for range integer, and it will automatically range from zero to one less than whatever the integer is.

So it’s not totally perfect… If you wanted to do something where “Oh, I want to skip by two”, or “I want to go backwards”, or whatever - it doesn’t do those things. It just is for going from zero to n, or n minus one. But that’s a pretty common case. So it just cuts down on the boilerplate a little bit. An example of where this is helpful is if you’re writing a benchmark, you’re supposed to say – well, before you would say for [unintelligible 00:07:03.12] and it will automatically loop through as many times as the benchmark wants it to be looped through.

Practical AI #225

Controlled and compliant AI applications

You can’t build robust systems with inconsistent, unstructured text output from LLMs. Moreover, LLM integrations scare corporate lawyers, finance departments, and security professionals due to hallucinations, cost, lack of compliance (e.g., HIPAA), leaked IP/PII, and “injection” vulnerabilities.

In this episode, Chris interviews Daniel about his new company called Prediction Guard, which addresses these issues. They discuss some practical methodologies for getting consistent, structured output from compliant AI systems. These systems, driven by open access models and various kinds of LLM wrappers, can help you delight customers AND navigate the increasing restrictions on “GPT” models.

Matched from the episode's transcript 👇

Daniel Whitenack: [00:26:07.01] There’s a lot of use cases where this may come up, but let’s take one for example. Let’s say that you’re doing data extraction. You have a database with a column in it, which is basically – so this scenario has happened at every company that I’ve been with, so I know that it’s very common… There’s some database with a table in it, and there’s a column that’s like a Comments column, or something… And it’s just like text blobs in there that are like notes from people, or technician messages, or user messages… Or whatever it is, it’s not structured. And you want to run a large language model over that to extract - maybe it’s phone numbers, or prices, or certain classes of information out of this column. Well, you could run your large language model and set up a prompt that says, “Give me the sentiment of each of these pieces of text in my database.” Well, that prompt, each time you run it through a large language model, maybe once it generates an output that says “Space positive sentiment”, and the next time it creates an output that says, “Positive.” And the next time it creates an output that says “This is positive sentiment.” And you can start to see there’s a consistency problem here, like “How do I parse all of these strange outputs from my large language model?” You can do a little bit of prompt engineering to get around that, but ultimately, it doesn’t solve the problem that you could have all sorts of weird output out of your large language model.

So ultimately, what you would want in that scenario is a system that lets you constrain and control what types of output you’re going to get out of your large language model. So in the case of sentiment, maybe I want to restrict my output to only pos, neg, and neu tags for sentiment. There’s only three choices, I always want one of those three. I don’t want it to say “This is positive sentiment.” So I want to actually structure and control the output of my large language model to produce one of these outputs.

Another example that’s maybe a little bit more complicated would be to say, “I actually want to output a valid JSON blob out of my large language model, or valid Python code out of my large language model.” And these are structures that are very well-defined, but you could have all sorts of variability coming out of your large language model. And if you want a specific type coming out of your large language model - maybe it’s a float - that you can do like greater than, or add it to another number, you need that as a typed output. Or you need very specific structured output to actually make automated decisions in your business.

And so with Prediction Guard, what we’re doing is we’re kind of assembling the best of the recent advances in this kind of control and structuring of output, and layering it on top of these open source large language models to allow you to say, “Here’s my prompt. I’m going to send it to these five open and/or closed” - we support Open AI as well… So “open and/or closed models, and for each output, I want you to give me a float number.” And that’s the sort of rich output that you can get from large language models very quickly with Prediction Guard kind of prompt, because you can control the models that you’re using either ones that are more privacy-conserving, or the closed source options, and provide constraints around the output that allow you to actually make business decisions on that. Now, there’s additional checks that could go along with that, like factuality checks and toxicity checks, which we also implement… But I’ve vomited up a lot of information, so I’ll pause here.

Practical AI #220

Causal inference

With all the LLM hype, it’s worth remembering that enterprise stakeholders want answers to “why” questions. Enter causal inference. Paul Hünermund has been doing research and writing on this topic for some time and joins us to introduce the topic. He also shares some relevant trends and some tips for getting started with methods including double machine learning, experimentation, difference-in-difference, and more.

Matched from the episode's transcript 👇

Paul Hünermund: I’ll start with fairness, because that’s actually the very first example that I use in my own course, Causality Causal Inference course here at Copenhagen Business School. It’s a case taken from Google actually, so a while ago, I think in 2019. Well, already earlier - the story goes longer, but they have been accused of underpaying women in their organization. So there we have a classic example of like a protected attribute, like gender, race, and so forth, and we want to prevent bias in some form of automated or semi-automated decision-making, right? And that comes up all the time. I mean, in loan acceptance models, for example, we want to remove bias, and so forth.

[34:23] So to make the story quick, is they have been accused of underpaying women in their organization, and then they did a fairly sophisticated analysis, published a whitepaper, and the result of that analysis was that they found that they’re actually underpaying men; at least they thought so. And not only men, but actually high-level software engineers, so high-seniority software engineers at Google. And then because they’re committed to fairness in their organization, they actually raised salary levels for these high-level software engineers based on the analysis. So it also had a practical component to it, or like a policy implication.

We cannot analyze this case here in detail, but if you do that analysis, it’s very likely that they actually did sort of fairly common causal inference mistakes, or they conditioned on some variables that are downstream, that are affected by gender, like occupation, for example… And then if you have discrimination already at that stage, that for example women don’t have it’s so easy to get into high-level positions for various reasons that we know of, then that will be a classic mistake, and you can produce these kind of, again, nonsensical correlations in the end, like the sharks and the ice cream.

That’s one example that you can actually easily transport to other kinds of questions - like I mentioned, algorithmic bias. And that’s a causal question, because if you don’t understand how variables in your model causally interact and relate to each other, you cannot answer this question, you cannot decide how to correctly analyze the data.

Robustness, I mentioned – so the transportability, transfer learning kind of aspect of experimental knowledge and their causal inference techniques have been developed… Also dealing with selection bias in data, so a dataset that might not be a representative sample of the population that you care about, but it’s measured with some form of selection bias, because only happy customers answer your consumer survey, or unhappy customers, but no one in between answers these questions…

And then lastly, explainability - I think explainability almost comes for free with causal inference. I mean, don’t get me wrong, causal inference is a hard task, but once you solve it, explainability almost comes for free, because - well, I mentioned “The book of why”, right? So causal questions are always related to why questions, counterfactual as well… Like, “Why did my headache go away? It wasn’t because I took the aspirin this morning.” I mentioned this example. This is the way we reason, this is the way we explain, for example, things to other humans, and so there’s an immediate connection to explainability.