Search results for coreos

The inside story on React’s all new docs

Rachel Nabors –beloved educator, animator, & documentation engineer at Meta– joins Amal and Amelia for a first look at the brand new React docs!

This massive overhaul to the React website (which supports 2 million+ developers around the world) was no easy feat! We dive into all the behind the scenes coordination, as well as the goals, wins, and intended outcomes of this new way of approaching educational content and API reference material for open source projects.

Matched from the episode's transcript 👇

Rachel Nabors: Awesome. Yeah, we’re about 75% done with the learning documentation. That’s because the remaining documentation is mostly, well, how to use things around edge cases. Effects are largely used for doing things with React, interacting with things outside React. There’s additionally – you know, we’re going to have to add some things for React’s developer tooling, which is coming later this year… So there’s some stuff that we didn’t have finished, but we had enough done that we didn’t wanna hold back until things were perfect. We wanted to make sure that we were actually getting the content to the community.

The API documentation itself is still very nascent. We wanna really make sure that we are – because hooks are very challenging to document, compared to more traditional APIs. They are deeply nested, they do interesting things with – you know, there’s this thing that returns a function, and that returns a clean-up function, and it takes a dependencies array, but it does different things depending on the state of that array… And a lot of those APIs depend on how we document that last 25% of content, how we explain how to use them. So there’s sort of road block by finishing the rest of the guides themselves. So they’re still en route.

Now, the community of course is very eager to assist in any way possible, and that is awesome, and we appreciate it… But we’re not quite ready to accept community assistance. These flagship pieces of documentation are really things that come right from the core’s heart… And it’s not just something you can churn out. I would know, I’ve tried. It really does get a lot of input from the core team.

Practical AI #156

Photonic computing for AI acceleration

There are a lot of people trying to innovate in the area of specialized AI hardware, but most of them are doing it with traditional transistors. Lightmatter is doing something totally different. They’re building photonic computers that are more power efficient and faster for AI inference. Nick Harris joins us in this episode to bring us up to speed on all the details.

Matched from the episode's transcript 👇

Nicholas Harris: It’s crazy. So like decadal timescales. But then, when you get there, what’s really important is that you can efficiently take lots of cores, lots of processor cores, and connect them to each other. Earlier, I was talking about Amdahl’s law, and the fact that you add another unit of compute, and unfortunately, because of communications, you don’t get another unit of throughput. We’ve invented an interconnect technology that - surprise, surprise - uses light. It’s a Wafer-scale computer chip, 8 inch x 8 inch, so it’s the size of your laptop screen practically, and it allows you to let your processors talk optically, and it can dynamically configure how they’re connected. Do you remember those old switch rooms where people are making phone calls, and they need to connect to some other…?

Changelog Interviews #459

Coding in the cloud with Codespaces

On this special edition of The Changelog, we’re talking with Cory Wilkerson, Senior Director of Engineering at GitHub, about GitHub Codespaces. For years now, the possibility of coding in the cloud seemed so close, yet so far away for a number of reasons. According to Cory, the raw ingredients to make coding in the cloud a reality have been there for years. The challenge has really been how the industry thinks, and we are now at a place where the skepticism in cloud based workflows is “non-existent.”

After 15 months in preview, GitHub not only announced the availability of Codespaces for Teams and Enterprise — they also showcased their internal adoption, with 600 of their 1,000 engineers using it daily to develop GitHub.com.

On this episode, Cory shares the full backstory of that journey and a peek into the future where we’re all coding in the cloud.

Matched from the episode's transcript 👇

Cory Wilkerson: It’s totally true. You can now just say “Well, we want the 16 cores, or 32 cores”, or whatever in config, and upgrade everyone’s machine if you want, assuming that you’ve got the approval to do so within your organization.

Ship It! #18

Bare metal meets Kubernetes

In this episode, Gerhard talks to David and Marques from Equinix Metal about the importance of bare metal for steady workloads. Terraform, Kubernetes and Tinkerbell come up, as does Crossplane - this conversation is a partial follow-up to episode 15.

David Flanagan, a.k.a. Rawkode, needs no introduction. Some of you may remember Marques Johansson from The new changelog.com setup for 2019. Marques was behind the Linode Terraforming that we used at the time, and our infrastructure was simpler because of it!

This is not just a great conversation about bare metal and Kubernetes, there is also a Rawkode Live following up: Live Debugging Changelog’s Production Kubernetes 🙌🏻

Matched from the episode's transcript 👇

David Flanagan: Yeah. Go spin up another M3, S3, C3 boxes, just for ten minutes, run [unintelligible 01:04:39.20] See the cores, see the memory…

Practical AI #148

Stellar inference speed via AutoNAS

Yonatan Geifman of Deci makes Daniel and Chris buckle up, and takes them on a tour of the ideas behind his amazing new inference platform. It enables AI developers to build, optimize, and deploy blazing-fast deep learning models on any hardware. Don’t blink or you’ll miss it!

Matched from the episode's transcript 👇

Yonatan Geifman: I think it really depends on the task that you’re trying to do the inference and what are you trying to achieve? I think that if we’re talking about video analytics workloads you must use GPU. You have a lot of data, you want to process the data, the images in high resolution, and you need GPU performance.

If you’re talking about having some queries of an NLP model, you usually find those deployed on CPUs. But it doesn’t matter, because both of them are getting expensive when you get to scale. So if you look at, for example, prices on the cloud for having a 4-core CPU and a T4 GPU - it’s approximately the same. So it’s not like the problem is only the prices of GPU; also the compute of the CPU is getting expensive when you have to run large workloads, with large clusters with multiple nodes and cores.

Go Time #194

Don't forget about memory management

Bryan Boreham (Grafana Labs) and Jordan Lewis (Cockroach Labs) join Mat and Jon to talk about memory management in Go. We learn about the heap, the stack, and the garbage collector. There are also some absolute gems of wisdom scattered throughout this episode, don’t miss it.

Matched from the episode's transcript 👇

Bryan Boreham: I think it might be useful to try and motivate, think for a bit about what is hard about managing this thing we call a heap. So first of all, what is it? So you’ve got some random program, it’s allocated a bunch of small blocks, maybe it’s allocated some bigger blocks, maybe some 64k blocks, maybe one block of 103402 bytes, and the memory manager, the heap manager has to let you do all of these things, and do them in any quantities, in any order, and within some bounds. Your whole computer has got 64 gigs of RAM, 16 gigs, or whatever; there’s some limit that you can’t go above. But the heap manager will let you allocate any number of blocks, of any size, within that overall limit. And it will let you free them up, stop using them in Go. You don’t explicitly do that when you no longer have any references to a particular piece of memory, then that’s considered garbage to begin with. It’s still hanging around… We’ll get to that in a second.

But assuming we’ve managed to free up some memory, now the manager has – the next time you wanna allocate some memory, it’s got the task of figuring out where there’s a hole. You don’t wanna just get bigger and bigger and bigger the whole time. You’ve freed 64k, and now you wanna allocate 64k - you should probably use the one you’ve just freed. So the memory manager has a task of trying to give you back the best block to keep things under control. Maybe not the best, but some kind of reasonable choice of block. It’s got a lot of options. If you freed up 64k and then you allocated 8 byes - well, it could take the first 8 bytes of that 64k.

I’m trying to motivate this picture that it’s actually really complicated to keep track of all these potentially millions and millions of blocks at all different sizes. Then we throw in some performance considerations that most computers these days have multiple CPU cores, and you really wanna keep the memory together on one CPU core and not have little bits of memory next to each other being used by different CPU cores, so the memory manager is gonna try and help that along; it’s gonna actually keep different, typically arenas of memory for different cores. And we haven’t even gotten to our garbage collection yet. It’s already really complicated. And any memory manager, in C, and C++, in Objective-C, whether you’re automated reference counting - they’re all doing the stuff I’ve talked about so far. They’re all kind of keeping track of what’s in use, what’s not in use, what could be reused… They’re all doing that.

Go Time #190

How to make mistakes in Go

The panel are joined by Teiva Harsanyi, author of 100 Go Mistakes, to talk about how best to make mistakes when writing Go.

Matched from the episode's transcript 👇

Teiva Harsanyi: [laughs] Now, I believe actually that it’s even more of a belief in Go thanks to goroutines, as you have said, Mat… Because goroutines compared to threads are great, they are more lightweight, they are faster to spin up, they are faster to contexts-switch, and so on… So there shouldn’t be any real reason for a concurrent application to be slower than a sequential application.

I took here a concrete example, actually… I took as an example the merge sort algorithm. And just as a quick reminder about what the merge sort algorithm is - if we take for example the recursive implementation, we basically get a list of elements as an input, and we will break down repeatedly each sub-list into sub-lists, into two halves. We are going to do it repeatedly. And once we reach sub-lists of a single element, we go up again and we merge the two sub-lists in a sorting manner.

A quick example, if we have 2 and 1, for example, we are going to split it into two halves, two on one side, one on the other side, and as each sub-list contains a single element, meaning it’s already sorted, then we are going to merge it in a sorting manner, so we will have one and two.

So in a nutshell, the merge sort algorithm - we just get a slice as an input, we check the length of the slice, if it’s bigger than one, we compute the middle, we apply merge sort on the first half, merge sort on the second half, and then we merge.

So the structure, for example, for this algorithm seems like a perfect fit for concurrency, because we could say every time I can handle each half into a specific goroutine. So the first half in one goroutine, the second in another goroutine, and say I will introduce some sort of synchronization at some point to wait for both goroutines.

So if we implement this parallel version of the algorithm, I run it on my local computer with a certain number of elements, and actually this parallel version is about ten times slower than the sequential version. And despite the fact that the parallel version leverages multiple cores, right? So it’s more than ten times slower. And what is the reason for that, if we think a bit about it? As we said, the algorithm is about to repeatedly split lists into two sub-lists; so at some point we will have 1,024 elements, then 512, then 256 and so on, until we reach 8, 4, 2 and 1 elements.

Now let’s try to imagine, in your opinion, what’s the fastest between spinning up two goroutines that will both merge two elements and wait for them, or in the current goroutine merge two elements and then merge two other elements? And of course, it’s gonna be the latter here, right? Because it’s gonna be faster to do it in the current goroutine.

[20:10] And if we think about it actually, in the merge sort algorithm, the deeper we go, the less efficient it will be to spin up a goroutine. And sure, goroutines are fast, but spinning up a new goroutine - it has a cost, because we have to wait for its creation, we have to wait for the internal Go scheduler to execute it, we have also the fact that concurrency introduces some form of synchronization because of mutex, or channels, or whatever… So everything has a cost, right?

Here one possible solution for this algorithm - the goal is not, of course, to design the most optimal solution for the merge sort algorithm, but discuss about a potential solution. It could be to say I will define a threshold, and I will apply the parallel algorithm that we’ve just described, but if the number of elements is below a certain threshold, it’s simply not worth spinning up new goroutines. So instead, I am going to execute sequentially. And this threshold may depend on the machine, and everything. On my side it was about 2048 elements. And if I run this new hybrid version, let’s say, of the parallel algorithm, it’s about 40% faster this time compared to the sequential implementation.

And one very last thing to say - I have done the same test in Java actually, where we don’t have the principle of goroutines (we just have threads here) and the threshold actually was higher. It was around four times bigger, if I recall correctly, compared to goroutines. So it’s kind of interesting, because somehow it shows that goroutines are actually somehow more efficient than threads for concurrent workloads, because they are for example faster to spin up… But as we illustrated with the merge sort algorithm, it’s not magic nonetheless; concurrency isn’t always faster.

Break: [22:19]

JS Party #178

Running Node natively in the browser

Eric Simons and the StackBlitz team recently announced WebContainers which let you run Node.js natively in your browser! This has BIG implications and leaves us with many BIG questions like: how did they do it, why did they do it, and where does it go from here? Tune in! Keyword: BIG

Matched from the episode's transcript 👇

Eric Simons: Bingo. But Fortune 100 companies - everyone has approved Google Chrome as a browser. These Fortune 100 companies - they have allowed a runtime explicitly for everyone at the company to use that they trust security-wise… For good reason. Google’s got 15,000 cores, right now, while we’re sitting, fuzzing that codebase, 24 hours a day, seven days a week… And they’ve got [unintelligible 00:28:40.04] This company is leading the web security industry. They take this stuff seriously. So if you’re gonna say “Hey, we’re gonna introduce a WebAssembly runtime that’s more local-based…”, which I have no doubt one will pop up… But if I were to bet money, I think it’s gonna probably be based on all of the work that Google does from a security standpoint, for the exact reason that Cloudflare uses V8 for Cloudflare workers.

Practical AI #129

Going full bore with Graphcore!

Dave Lacey takes Daniel and Chris on a journey that connects the user interfaces that we already know - TensorFlow and PyTorch - with the layers that connect to the underlying hardware. Along the way, we learn about Poplar Graph Framework Software. If you are the type of practitioner who values ‘under the hood’ knowledge, then this is the episode for you.

Matched from the episode's transcript 👇

Dave Lacey: So Poplar is our graph programming framework. So that is a way of representing graphs that run natively on our device, that do these kinds of operations… And in Poplar we have graphs that kind of break it down to the individual processes. So on each of our chips we have about 1,400 cores/processors, each of which has hardware threading in there, so you’ve got about 7,000 parallel compute units. And Poplar graph kind of represents the graph at that kind of level. And poplib is what kind of then says “Well, I’ve got this matrix multiplying to do. How do I split that over the parallel units in an official way?” So that’s where it does partitioning, and axis splitting, and stuff like that.

Then we have the Poplar graph compiler, which then will that fine low-level graph and create actual code to the device, which then goes into the graph engine, which then runs it.

There are quite a few levels… You notice quite a few compilers involved. We counted them, and there’s like 5-6 different compilers that have to interact to get that efficient implementation down on that device.

There’s some other things that go on, like sometimes you might want [unintelligible 00:29:30.19] so at a higher level you’ll do model pipelining, and things like that, to get efficient models for [unintelligible 00:29:39.02] and things like that… But fundamentally, that’s the flow.

JS Party #170

Headlines? More like HeadLIES!

Jerod and Nick discuss the big Deno news, play a ridiculous new game in honor of April Fool’s Day, then give shout outs to some awesome software projects we love.

Matched from the episode's transcript 👇

Jerod Santo: Like you’re five? [laughs] It’s a limited liability corp– no, what are they doing… So I don’t know exactly what they’re going to do; I will tell you that they’re not going to do an open core business model, which would be where they provide certain features of Deno and some sort of like an open source core, and then build on top of that, around it, more advanced, or pro, or premium features of deno, and make that what you pay for. They’re not doing that. In fact, they software is MIT-licensed, and will retain the MIT license. In fact, Ryan says in their post “For Deno to grow and be maximally useful, it must remain permissively free. We don’t believe the open core business model is right for a programming platform like Deno. We do not wanna find ourselves in the unfortunate position where we have to decide if certain features are for paid customers only.” That’s really the rub with these open cores - deciding what goes where… And there’s a conflict of interest at different times, and it can be difficult to navigate that successfully.

They say “If you watch our conference talks, you will find we’ve been hinting at commercial applications of this infrastructure for years. We are bullish about the technology stack we’ve built, and intend to pursue those commercial applications ourselves. Our business will build on the open source project, not attempt to monetize it directly.”

So that’s what they’re saying… Now, TBD what exactly all that means. There is on the new Deno.com - so they probably shelled out some of that five million on getting Deno.com, because it’s always been deno.land, and now they have deno.com, because it’s official. They have a new Deploy section, which seems a hint at their first potentially commercial offering… I just don’t know exactly what that is. Did you check out that deploy thing?

Changelog Interviews #433

Open source, not open contribution

This week we’re talking with Ben Johnson. Ben is known for his work on BoltDB, his work in open source, and as a freelance Go developer. Late January when Ben open sourced his newest project Litestream in the readme he shared how the project was open source, but not open for contribution. His reason was to protect his mental health and the long term viability of the project. On this episode we talk with Ben about what that means, his thoughts on mental health and burnout in open source, choosing a license, and the details behind Litestream - a standalone streaming replication tool for SQLite.

Matched from the episode's transcript 👇

Ben Johnson: Yeah, so I think scaling is an interesting topic in our field. I feel like there’s been an obsession over scaling, and uptime. I think they have kind of gone off the rails over the last 10-20 years, where we have this idea of like – everyone tries to build their application to be the next Twitter, or whatnot. People worry about “What if I have to scale like crazy, in whatever amount of time?” And generally that’s not the case, first of all. But given Moore’s Law, where we are seeing exponential increases in compute that we have available on single blocks, but for some weird reason we keep having this exponential scaling of the number of nodes we actually need to run applications… It seems backwards to me.

We have nodes on Amazon where you can spin up a 96-core box for however much money a month… But that’s a lot of cores. Each one’s doing three billion operations per second. We should be able to run a couple hundred HTTP requests to that. So as far as the scaling piece, I find that most people, if you’re running a local SQLite database, you’re not gonna hit those scaling concerns.

Actually, one scaling concern I find people actually hit is things like Postgres tend to have a high overhead for connections, so you end up having to put in something like a PgBouncer in between, that can actually start to pull those connections to not overload Postgres… Whereas you just don’t get that when you have an in-process database.

[01:08:02.24] So from that standpoint it’s great… I would say that if you’re running applications – again, I write in Go; it’s a super-fast language, and running locally, I can push through thousands and thousands of requests per second on pretty modest hardware… And I think that that really covers probably 90% of applications out there that people are gonna write. And even if you don’t use SQLite for your main company’s application, there’s probably a ton of applications in your company that are on the side, that are periphery, that don’t need to be some huge Kubernetes cluster.

So I would say that on the scaling side… And then on the uptime side, I feel like people have this obsession around uptime…. But I feel like the more tools that people add – and I don’t really mean to rag on Kubernetes all the time; I do, but I think of it as a tool that has an appropriate use case, but it’s not the vast majority of people’s use cases.

I think that from an uptime perspective you’re getting many more layers of complexity in there that are gonna cause you to have more downtime than simply running a single node that may go down because of a network connection once a year, or a couple times a year, for a couple minutes. I don’t think people really take in the cost of downtime when they think about the trade-off they’re making to make these complex systems that give them the illusion of uptime. I hope that makes sense.

Practical AI #115

From research to product at Azure AI

Bharat Sandhu, Director of Azure AI and Mixed Reality at Microsoft, joins Chris and Daniel to talk about how Microsoft is making AI accessible and productive for users, and how AI solutions can address real world challenges that customers face. He also shares Microsoft’s research-to-product process, along with the advances they have made in computer vision, image captioning, and how researchers were able to make AI that can describe images as well as people do.

Matched from the episode's transcript 👇

Bharat Sandu: Yeah. You know, we’re still working with them on a lot of these things right now. Right now, what we wanna make sure we can enable OpenAI to do - really breakthrough AI research, and to give them amazing cloud resources, so things we spoke about. I think we have now the fifth largest super-computer. It was probably the first one in the cloud, that they access to build these models. These are about 300,000 CPU cores, 10,000 GPUs, and the networking layer that goes with it… And then also it allows us to develop our optimizations.

You might have heard of something like ONNX Runtime, which is useful for high-speed inferencing, but we’ve also tuned it to do high-speed training. But all those optimizations kind of came in also with the work that we’ve been doing with OpenAI, and even internally, and all that stuff.

So yeah, there’s more to come on where some of this work shows up, but we also – our goal is to allow OpenAI to do amazing work like GPT-3, and give them the ability to do it from tooling, and all that. And all that also shows up for our customers, even if they don’t get access to a GPT-3 model today, which they can get from OpenAI. But all the work that went into enabling OpenAI to do the work is available to our customers also.

Changelog Interviews #421

The future of Mac

We have a BIG show for you today. We’re talking about the future of the Mac. Coming off of Apple’s “One more thing.” event to launch the Apple M1 chip and M1 powered Macs, we have a two part show giving you the perspective of Apple as well as a Mac app developer on the future of the Mac.

Part 1 features Tim Triemstra from Apple. Tim is the Product Marketing Manager for Developer Technologies. He’s been at Apple for 15 years and the team he manages is responsible for developer tools and technologies including Xcode, Swift Playgrounds, the Swift language, and UNIX tools.

Part 2 features Ken Case from The Omni Group. Ken is the Founder and CEO of The Omni Group and they’re well known for their Omni Productivity Suite including OmniFocus, OmniPlan, OmniGraffle, and OmniOutliner – all of which are developed for iOS & Mac.

Matched from the episode's transcript 👇

Ken Case: And when we looked at our development cycle, a lot of the products that we build, like OmniGraffle, a major version of that might take about two years to build. So when we were starting that process, we were expecting that the hardware that we would end up shipping on would be twice as fast as where we were starting. And obviously, we have to be careful; we wanna run on existing shipping hardware as well, but there are things that you can plan to do that you know are possible on faster hardware - better animations, and so on. Or adopting some of the stuff that you’ve mentioned earlier about our look and feel matching the platform really well. That might not make sense if you thought you were stuck on the same hardware that you added that day. [laughter]

And then of course in the 2000’s and 2010’s we had petered out. We ran into limitations in the hardware, and we started trying to solve that problem by scaling out the processors to more and more cores. We didn’t have single cores getting faster, at least not nearly at the same pace, but we tried to [unintelligible 01:11:03.15] that could work well with multiple cores.

Well, looking ahead, I’m looking forward to the return of having cores getting faster and faster, individual cores. I’m looking forward to knowing that we can build some things that maybe today’s hardware isn’t capable of, but next year’s will be.

Practical AI #112

Building a deep learning workstation

What’s it like to try and build your own deep learning workstation? Is it worth it in terms of money, effort, and maintenance? Then once built, what’s the best way to utilize it? Chris and Daniel dig into questions today as they talk about Daniel’s recent workstation build. He built a workstation for his NLP and Speech work with two GPUs, and it has been serving him well (minus a few things he would change if he did it again).

Matched from the episode's transcript 👇

Daniel Whitenack: And looking back, I would tell myself to just go ahead and take that price hike… Because if that was the case, I wouldn’t constantly be moving models around between these two systems to do my testing… The way I understand it, the Threadripper, the AMD chip has more cores… So if yo’re doing a lot of multi-threaded stuff, it’s really good. But the single-core speed on the Intel processors is higher. So depending on what your workload is, then you could actually get a performance boost, even though you have fewer cores, but with a higher core speed with the Intels. All of that stuff - it seems to work a little bit better for me… And this is just my own personal experience.

Brain Science #30

I'm just so stressed

Stress is something that we will inevitably encounter throughout our lives. It isn’t all bad or maladaptive, but how we manage it can make a significant difference in our lives. The degree of stress we feel impacts how we show up in the world including both how we relate and how we do the work before us each day.

In this episode, Mireille and Adam discuss the impact of stress on our systems including the role of different stress hormones on our immune system, cardiovascular system and our metabolism. Like many other conversations on previous episodes, we provide research relative to the value of relationships as having close connections helps us all combat the stress that loneliness can cause as well. When we utilize resources to support us as well as set limits on what we expose ourselves to and focus our attention to, we have the opportunity to better navigate the stresses of our lives.

Matched from the episode's transcript 👇

Mireille Reece, PsyD: Yeah. So we’re going to be talking about some different, more sciency terms to help you guys understand a little bit more of what happens. And so one of the things is glucocorticoids. These are named that for their ability to promote the conversion of proteins and fats to usable carbohydrates. So they’re super adaptive or help us by replenishing the energy reserves after a period of activity, like running from a predator. But they also act on the brain to increase appetite for food and increase locomotor activity so that it can regulate more of our energy, input and expenditure.

So like you were mentioning, Adam, with the choices we make, it’s super helpful if I’m trying to run a few miles, but not as helpful if I’m trying to grab a box of Oreos, or while I’m trying to write or do some work. That inactivity or lack of energy expenditure creates this situation where these chronically elevated glucocorticoids can impede the action of insulin to promote glucose uptake. So it’s like I’ve got too much of this thing and I’m not expanding it or putting it in a way that actually helps my brain and body defrag.

JS Party #142

Horse JS speaks!

We kick off with some exciting TypeScript news, follow that with some exciting JavaScript news, then finish off with an exciting interview. Key word: EXCITING

Matched from the episode's transcript 👇

Jerod Santo: So things that Elder provides for you - build hooks and a highly-optimized build process, and it’ll span all your CPU cores. So while Slack is using all your memory, Elder will use all your processors, and build it as fast as possible. It’s built for large sites, and the SEO of sites of ten to a hundred thousand plus pages. It uses Svelte everywhere, including your SSR templates… And as well as partial hydration. So check it out. If Svelte is something you’re interested in and you enjoy static sites/JAMstack things, maybe Elder will get you started.

Go Time #145

Füźžįñg

A deep dive on Fuzzing and a close look at the official Fuzzing proposal for Go.

Matched from the episode's transcript 👇

Filippo Valsorda: [23:46] I think there’s also an angle of maturity of the ecosystem in there, of maturity of the technique… Because when fuzzing is just this tool that some security researchers use to smash against a program once, try to get something out of it and then move on - of course, they just run the corpus wherever they’re keeping it. But I feel like just like with testing we set up continuous integration and we trust machines to do the heavy lifting for us, I expect that fuzzing also takes that path once it’s built into developer workflows.

So you would have a small corpus locally on your machine, and Katie’s proposal puts it automatically in a cache folder… That will do a very quick pass, but you’re not gonna run the fuzzer mostly on your laptop. Part of what makes fuzzers work is that computers are fast, but also you can keep throwing more cores at it. And then you upload it, and some CI or OSS-Fuzz or some continuous integration system can just run the fuzzer, and it should persist the corpus, so it will keep running the same corpus against it, so that you make changes and the corpus is already hot and large, but is not checked into your repository, because most people don’t want megabytes and megabytes of corpus checked in.

Brain Science #25

The science behind caffeine

Today’s episode features our very first guest. We’re joined by Danielle Rath, a notable expert and product developer in the caffeine and energy drink industry. Danielle is the founder of GreenEyedGuide Research and Consulting where she shares science-based information about energy drinks and caffeine, and helps people and companies where fatigue and caffeine use are prevalent. In this lengthly episode, we talk through all aspects of the science behind caffeine — its chemical structure and half-life, where and how it’s being used, the good, bad, and the ugly, as well as practical advice for everyday consumption. If you consume caffeine of any sort, this is a must listen episode.

Matched from the episode's transcript 👇

Danielle Rath: Well, Oreos are healthy because they’re vegan? Well, you know… [laughter]

Changelog Interviews #403

Laws for hackers to live by

Dave Kerr joins Jerod to discuss the various laws, theories, principles, and patterns that we developers find useful in our work and life. We unpack Hanlon’s Razor, Gall’s Law, Murphy’s Law, Kernighan’s Law, and too many others to list here.

Matched from the episode's transcript 👇

Jerod Santo: Oh, gosh… Let me throw my friend Nick Nisi under the bus, who is a good friend and a good engineer and a JS Party panelist. We’re working on some software around JS Party’s game show; we have a Jeopardy-style game show called JS Danger, and we’ve built a web app so you can actually have a gameboard… And in that web app you have the contestants, and they have their faces. So it’s “These are the three people who are the contestants”, and we’ve put their avatars in there… And I built the first version of things; I was building out the JSON structure of how we’re gonna load this data as we can reuse this gameboard… And I’d just go out and I’d figure “Well, we’ll just load a URL and make an image source.” So I just go out to their Twitter profiles, and I right-click, and – I can’t remember if I download the file, or I just grab the URL and throw that into the JSON blob.

Then I pass it off to Nick to continue working on this, and he decides that instead of just a string, which holds a URL to an image, he’s gonna have like a handler function, which does something else… And that way, we can just put their Twitter name in, whoever it is, and it will go determine whatever their actual current photo is, and all this kind of stuff. And then – I mean, totally YAGNI, by the way. We’re gonna use this gameboard like once a month, once every few months… And we know the contestants beforehand, and it takes about 30 seconds to go grab those URLs. But a dynamic lookup was nice, even though YAGNI, until something changed in Twitter’s API, and the core’s rules, or something… Anyways, he couldn’t deterministically figure out what the URLs were anymore, so then he had to write a proxy server in order to resolve the actual URLs of the avatar images, and get a token, and all this kind of stuff. So sorry, Nick, but I threw you under the bus there. We’ve all done it. You were just over-engineering a thing that was totally YAGNI… And he had fun doing it.

Changelog Interviews #402

What's next for José Valim and Elixir?

We’re joined again by José Valim talking about the recent acquihire of Plataformatec and what that means for the Elixir language, as well as José. We also talk about Dashbit a new 3 person company he helped form from work done while at Plataformatec to help startups and enterprises adopt and run Elixir in production. Lastly we talk about a new idea José has called Bytepack that aims to help developers package and deliver software products to developers and enterprises.

Matched from the episode's transcript 👇

José Valim: [27:42] Yeah, so Broadway is a library for doing data ingestions in data pipelines in Elixir. If you want to consume data from SQS, or RabbitMQ, or Google Cloud Pub/Sub or Kafka in a very efficient way, utilizing all the cores in your machine and doing batching, automatic acknowledgments, all those kinds of things that are expected from a robust data processing pipeline or data ingestion, Broadway is exactly for that.

And you know what’s the coolest thing about Broadway? It’s how and why we created it. That goes directly to what Adam was saying about the heart of the company… So when we started the subscription - this was still back at Plataformatec - we were working with different clients, and we were using a library called GenStage. So Broadway is built on top of GenStage, and it’s called Broadway exactly because it coordinates these stages for you; that’s kind of the pun in the name.

So we have a lot of companies that were using GenStage, our clients, and they were building those complex pipelines, and we were seeing out clients make the same mistakes over and over again… So we would work with them, improve, and then it came upon us, like “Wait, if everybody is building this and everybody is making the same mistakes, there’s probably something that we can do about it, or maybe there’s a higher-level abstraction.”

So we started working on Broadway, that’s how Broadway came to be… And it was really nice to later work with those clients, where they were getting their old code and they would ping us in the PRs, like “We have removed 600 lines of code, and we are adding 50 lines of code thanks to Broadway.” So it was really just like, you know, work with the client, see exactly what’s wrong or where it’s lacking, getting that energy, putting it back into open source, and then going for this whole cycle of getting this feedback… But yeah, that’s Broadway, in a nutshell.

Backstage #10

YouTube made me do it

Long-time listener (and YouTube aficionado) Owen Bickford joins Jerod backstage to discuss his recent contribution to Changelog’s Elixir/Phoenix-based open source platform.

Matched from the episode's transcript 👇

Owen Bickford: So kind of the excellent thing about Elixir is it builds on Erlang, and the compiler can take advantage of whatever cores you have on your machine, and whatever threads, if it’s multi-threaded. So while Elixir doesn’t do multi-threading internally per se, the compiler can use all the threads that are available on your computer. So with a dual core you’re really constrained. So you’re waiting longer for each of these files to compile.

[28:13] I just built a Linux machine for the first time. It’s an 8-core Ryzen. It wasn’t really suffering quite as much with those 220 files. It would take a couple of seconds… But I could see in the video it would take several seconds –

Go Time #109

Concurrency, parallelism, and async design

Go was designed with concurrency in mind. That’s why we have language primitives like goroutines, channels, wait groups, and mutexes. They’re very powerful when used correctly, but they can be very complicated if used unwisely.

Roberto Clapis joins the team once again to drop async wisdom in your ears. Don’t worry, we do it in serial. 😉

Matched from the episode's transcript 👇

Roberto Clapis: I think it does. Actually, before using Go, I had been using Python for years, and at one point I needed to solve a problem using all the 36 cores that I had available on a cluster at my university… And the pain of doing that with a language that wasn’t designed with that in mind actually brought me to say “Okay, what about learning a new language that maybe makes this easier?” And with Go, it was I think 100 lines and I was done.

Go Time #103

All about caching

Manish Jain and Karl McGuire of Dgraph join Johnny and Jon to discuss caching in Go. What are caches, hit rates, admission policies, and why do they matter? How can you get started using a cache in your applications?

Matched from the episode's transcript 👇

Manish R Jain: I think before we begin the discussion I should probably explain the scale. In this case, by scale at the internal system memory level we’re talking about scaling in terms of the number of cores, the number of goroutines, the number of concurrent lookups that could be happening… As opposed to when we talk about database scale, we talk about different machines and how much terabytes of data you can keep. So scale in this case is the number of concurrent accesses that could happen…

[15:59] So we tried in Dgraph a bunch of different techniques. The simplest thing that anybody could do is take a map in Go, put a mutex lock around it, and then for every get you just acquire the lock and you do the retrieval. Now, that would work, and that works very nicely for some basic use cases with low concurrency, but it becomes a hard challenge on what to evict and when. If you do it badly, you will directly affect your hit ratios, which means that things would actually slow down… Because note that a cache can also slow things down, right? Cache is an extra step that you have to do. Not only do you have to retrieve the data from the underlying hard disk or system, you also have to first check in the cache if the data exists, and then later on put it into the cache. That lock acquisition and release can become a source of contention, as we’ve found in Dgraph.

In Dgraph what we had done was we took the LRU implementation by groupcache, run by Brad Fitzpatrick of Memcached team and obviously the Go team. It was obviously a very nice implementation of LRU cache that we picked up. We put a lock around it and we started using it, and we knew that we’d have to optimize it at some point, but we did not realize how bad it was.

At some point I was looking at a particular query - this was one year after implementing the system - and we realized that if we were to remove the cache, our queries would improve by five to ten times… Even a 30% query improvement is a good day for an engineer, but when you increase it ten times, that’s just incredible. So we immediately removed the cache and we started to look around to see what we could use. That’s when the whole idea for ristretto started.

Practical AI #59

Flying high with AI drone racing at AlphaPilot

Chris and Daniel talk with Keith Lynn, AlphaPilot Program Manager at Lockheed Martin. AlphaPilot is an open innovation challenge, developing artificial intelligence for high-speed racing drones, created through a partnership between Lockheed Martin and The Drone Racing League (DRL).

AlphaPilot challenged university teams from around the world to design AI capable of flying a drone without any human intervention or navigational pre-programming. Autonomous drones will race head-to-head through complex, three-dimensional tracks in DRL’s new Artificial Intelligence Robotic Racing (AIRR) Circuit. The winning team could win up to $2 million in prizes.

Keith shares the incredible story of how AlphaPilot got started, just prior to its debut race in Orlando, which will be broadcast on NBC Sports.

Matched from the episode's transcript 👇

Chris Benson: Yeah, commenting on it - and I actually have the Xavier listed here in terms of specs… This is the same GPU computer that is used in autonomous vehicles. It’s a 512-Core Volta GPU with Tensor Cores, 8-Core ARM with a 64-Bit CPU, 16 GB of 256 LPDDR4x memory, 32 GB of flash storage… It’s quite a computer, without going through the whole thing. I know that when I saw you last at the event, you were putting a pretty serious computer on these drones, and I was pretty impressed with the performance even so.

Practical AI #56

Worlds are colliding - AI and HPC

In this very special fully-connected episode of Practical AI, Daniel interviews Chris. They discuss High Performance Computing (HPC) and how it is colliding with the world of AI. Chris explains how HPC differs from cloud/on-prem infrastructure, and he highlights some of the challenges of an HPC-based AI strategy.

Matched from the episode's transcript 👇

Chris Benson: Sure. On the CPU side, a large use case can consume tens of thousands of cores to run simulations in tremendous detail, and be able to do all the parallel computation that’s required of that. People from our side, with our bias, tend to think “Oh, well that’s gonna be eclipsed, and the world goes GPU”, but there are many use cases that are not necessarily specifically optimized for GPU. We are seeing some cross-over there, and there are companies out there that are in the GPU space, NVIDIA being one of them, that are basically trying to pull traditional CPU-based use cases over into the GPU world. You have to do that assessment of what that means to your organization and the projects that you’re involved in. It’s kind of funny - so you can get to that level on the CPU side.

On the GPU side, it’s interesting that as HPC is really addressing the artificial intelligence and machine learning space at this point, then you get into a situation where you can almost consume - for really sophisticated training techniques - a tremendous amount of computation. So it’s really not always about just “I have X number of GPUs. Okay, that’s my requirement.” Going forward, in training we have concepts like mass hyperparameter exploration, where you’re trying to find optimal sets of hyperparameters for your AI model, and you’re training them in parallel, varying hyperparameters, so that you can find the various performance gains and optimizations to do that. That’s one way where you essentially can absorb all the compute that’s available to you. And then there are other things like deep reinforcement learning; we’ll get into things like large-scale self-play, where you are allowing the agents to run, and going through that training cycle of deep reinforcement learning also in parallel to speed up, and to also find different avenues through that.

And then at the end of the day, those are kind of served by auto-scaling anyway, so it’s less of “Well, I have X number of GPUs and I’m gonna run with that over a given period of time. That meets my requirements”, and more like “If we’re gonna do something like this, how much capacity do I have right now?” It may be that in my prior effort, with a slightly different approach, I only needed a certain number of GPUs, but if I’m for instance gonna jump in to doing this mass-scale hyperparameter exploration, I might try to suck in every GPU I can to get through that, so that I can get through it in minutes or hours, instead of days or weeks or months. So I guess the elasticity necessary in your high-performance computing cluster becomes very important, so you have to have strategies that can accommodate those types of use cases.

JS Party #88

Droppin' insider logic bombs

Jerod, Feross, and Nick discuss the latest npm security fiasco, opine on the strengths and weaknesses of spreadsheets, explain CORS like they’re 5 (sorta), and give shout outs to deserving purveyors of fine software.

Matched from the episode's transcript 👇

Nick Nisi: Alright… So CORS, like apple cores… No, CORS as in Cross-Origin Resource Sharing is the topic I'm gonna try to explain to a five-year-old, although I probably won't be able to go that far with it, because you do have to know a little bit about how that network request can be made from websites, and things…

Backstage #5

The Pro Stand costs more than my first car

Jerod, Adam, and Nick get together mere minutes after Apple’s 2019 WWDC keynote to talk about all the news and announcements. Will we be buying the new Mac Pro? What about that drool-worthy 6k retina display? Will iOS’s dark mode deliver where Mojave’s hasn’t? Expect all that and at least 2 bad puns in this episode of Backstage.

Matched from the episode's transcript 👇

Nick Nisi: I’m looking at the tech specs, and I don’t know if they mentioned this in the talk or not, but it does start at 32 gigs of RAM, and it goes all the way up to 1.5 terabytes of RAM, and 28 cores.

Practical AI #40

Deep Reinforcement Learning

While attending the NVIDIA GPU Technology Conference in Silicon Valley, Chris met up with Adam Stooke, a speaker and PhD student at UC Berkeley who is doing groundbreaking work in large-scale deep reinforcement learning and robotics. Adam took Chris on a tour of deep reinforcement learning - explaining what it is, how it works, and why it’s one of the hottest technologies in artificial intelligence!

Matched from the episode's transcript 👇

Adam Stooke: Again, a lot of the work at the beginning was just scaling out the reinforcement learning itself, taking existing algorithms and discovering that they can be scaled up to run on the entire system, so that we could use all eight GPUs and all 40 CPU cores within a DGX-1 to learn a single Atari game, and get basically linear speed-ups with that. So instead of taking 10 or 15 hours to master pong, we’re getting it to like four minutes or so…

Practical AI #39

Making the world a better place at the AI for Good Foundation

Longtime listeners know that we’re always advocating for ‘AI for good’, but this week we have taken it to a whole new level. We had the privilege of chatting with James Hodson, Director of the AI for Good Foundation, about ways they have used artificial intelligence to positively-impact the world - from food production to climate change. James inspired us to find our own ways to use AI for good, and we challenge our listeners to get out there and do some good!

Matched from the episode's transcript 👇

James Hodson: We’ve got two prongs on this particular area right now. The first is that we are organizing what we’re calling The Earth Day Summit in Alaska in Anchorage, in August. This will bring together machine learning researchers, machine learning practitioners, scientists who work with the IPCC scientists from NSF, from various other large international or national grant-making organizations that work in this area. That’s the first time that we’re going to see an organized and large-scale set of conversations exactly on the topic of how machine learning can help with the various climate change-related challenges that we face.

[32:09] Now, many people don’t realize, but most datasets used by the IPCC are tiny. They’re on the order of tens of samples, because you can’t take more than tens of samples of ice cores and you can’t look at testing gas concentrations in more than 10 or 20 different locations globally without it becoming cost-prohibitive. So many of the problems aren’t big data problems, but if we’re talking about practical AI, there’s no reason why machine learning has to be a big data problem. This is a new myth that has been generated. We have methods for dealing with small data too, and some problems converge faster than others, and some problems require less data in order to achieve the same performance, depending upon how you go about finding solutions.

So we’re all about starting those kinds of conversations, and not hiding behind the stereotype of machine learning as being large convolutional neural nets with millions of samples.

Changelog Interviews #330

source{d} turns code into actionable insights

Adam caught up with Francesc Campoy at KubeCon + CloudNativeCon 2018 in Seattle, WA to talk about the work he’s doing at Source{d} to apply Machine Learning to source code, and turn that codebase into actionable insights. It’s a movement they’re driving called Machine Learning on Code. They talked through their open source products, how they work, what types of insights can be gained, and they also talked through the code analysis Francesc did on the Kubernetes code base. This is as close as you get to the bleeding edge and we’re very interested to see where this goes.

Matched from the episode's transcript 👇

Francesc Campoy: Yeah. I mean, it’s pretty large, because it’s big data. The analysis that I did for the Kubernetes codebase - I was running on an instance in Google Cloud platform, with (I think it was) 96 cores. So you know, a pretty large instance… And yeah, the analysis of counting all of the languages, for all of the commits over time took around ten minutes. It’s not that bad, actually… But if you’re trying to do this for a very large thing, 96 cores is gonna be maybe enough at the beginning. But eventually you’ll want to have it distributed, and that’s where basically we’re saying, you know, once you need more than one node, then it’s enterprise edition and we should talk… Because the whole idea is that we wanna give as much as possible to the open source community, and especially the engine can be a really powerful way to obtain data for all of the research part of machine learning. There’s a lot of people doing research, and they need datasets. The fact that they will be able to generate those datasets by running SQL queries that they already know very well - it’s super-powerful. So we wanna make sure that they’ve got access to that.

[12:14] But for larger companies that wanna do analysis – and the interesting thing is that those metrics that we came out with, you can tweak them… And we are going to come up with a catalog of the kind of metrics that you should be figuring out and looking at. For instance, if you’re saying “I’m gonna be moving on cloud-native… Cloud Native Computing Foundation, I’m gonna go cloud-native.” Cool. What are the things that you should be looking at? Well, you should have a Docker file, you should have continuous integration, you should have continuous deployment. All of these things nowadays are in the source code, so we can analyze those things and give you a little bit of an idea of, if you’re going towards being cloud-native, how far away are you from getting there, and also, what are the things that you should be changing, what piece of the source code should be worked on in order to get there.

That is super-useful, because basically the whole idea is that it brings visibility to processes like going cloud-native, or adopting inner source, or adopting DevOps… Lots of people talk about “Oh, we’re gonna be doing DevOps.” What does that mean, right?