Search results for [BCCMINING]💯crypto yield optimization

Join the federation?! Mastodon awaits...

We talked with Eugen Rochko, the creator of Mastodon, about where Mastodon came from the problem it aimed to solve. How it’s not exactly Twitter alternative, although that’s its known claim to fame. Why it’s probably not going anywhere. The ins-and-outs of federation, getting started, running an instance, why you would want to — cool stuff you’ve never considered could be built on top of Mastodon. And finally, the story behind naming posted content a “toot”.

Matched from the episode's transcript 👇

Jerod Santo: It’d also be a good way to get involved as a contributor - Mastodon open source, a lot of people running the instances… I don’t think there’s low-hanging fruit, but there’s at least fruit hanging out there, wherein optimizations could yield huge gains for all these people who are hosting and who are bearing real costs month by month, in reducing those fees, and you basically make the network a lot better. It’d be a good way to get involved.

Practical AI #194

Evaluating models without test data

WeightWatcher, created by Charles Martin, is an open source diagnostic tool for analyzing Neural Networks without training or even test data! Charles joins us in this episode to discuss the tool and how it fills certain gaps in current model evaluation workflows. Along the way, we discuss statistical methods from physics and a variety of practical ways to modify your training runs.

Matched from the episode's transcript 👇

Charles Martin: Part of what I wanna do with the tool is build an open source community. I can’t do everything myself, and there’s lots of things to do, and if people want to get involved in the community, join the Slack channel. We can build things. That’s what open source is. And I think a lot of people may have ideas, and will be able to contribute in ways that we’ve just expanded. Again, right now to me the way you train neural networks now - it’s like you build a bridge, you drive a car over the bridge, you see if the bridge falls down. And you do it again and again and again. How many cars are you going to crash into the ocean until you get the bridge right? No, people don’t build bridges like that. You build bridges by having engineering principles. You understand, “Here are the engineering principles that go in, and this is the load it can take, and this is the wind shear, and you try to build bridges that actually stay up. And right now, I think deep learning is so brute force; it’s like, you just spend as much money as you can, do as much brute force as you can, and if it doesn’t work, you try it again. And there’s no principles behind what you’re doing, and we’re trying to add some – and principles that are based in deep theory… Like, there are empirical rules of thumb, but there’s also deep theoretical reasons why they work, just like in any other field of optimization.

Practical AI #79

TensorFlow in the cloud

Craig Wiley, from Google Cloud, joins us to discuss various pieces of the TensorFlow ecosystem along with TensorFlow Enterprise. He sheds light on how enterprises are utilizing AI and supporting AI-driven applications in the Cloud. He also clarifies Google’s relationship to TensorFlow and explains how TensorFlow development is impacting Google Cloud Platform.

Matched from the episode's transcript 👇

Craig Wiley: Sure. As you said, I run project management for our AI platform here at Google. Previous to that I spent a couple of years at AWS, building Amazon SageMaker. Then previous to that I’ve spent a number of years in Amazon’s Supply Chain group, doing optimizations, starting with pivot tables and moving on to classic econometric regression, and then moving on to more and more unsupervised and deeper and deeper learning, as we tried to solve some of these unbelievably complex problems. The ever-present goal of trying to make all of this go faster and yield stronger results got me super-interested in the tooling space.

Since SageMaker are now at Google, I’ve been really focused on how it is that we can unlock the power of data within the enterprise, and give companies and enterprises the ability to gain full benefit from the datasets they’re collecting.

DEV.to

A future without Webpack

We continue to use bundlers even though ES Modules (the new JavaScript module system) runs natively on the web. Why?

Over the last several years, JavaScript bundling has morphed from a production-only optimization into a required build step for most web applications. Whether you love this or hate it, it’s hard to deny that bundlers have added a ton of new complexity to web development – a field of development that has always taken pride in its view-source, easy-to-get-started ethos.

Related ~> JS Party #69

Practical AI #152

The mathematics of machine learning

Tivadar Danka is an educator and content creator in the machine learning space, and he is writing a book to help practitioners go from high school mathematics to mathematics of neural networks. His explanations are lucid and easy to understand. You have never had such a fun and interesting conversation about calculus, linear algebra, and probability theory before!

Matched from the episode's transcript 👇

Tivadar Danka: And really, it’s to serve as some kind of blueprint. I tried to min/max this roadmap. To be more specific, I tried to leave out every field or subfield which is not of interest, but I wanted to include everything that might be eventually interesting… And I worked from backwards. I started with neural networks… like, what you need; basically, you need optimization techniques, and you need linear algebra to describe models. And you need probability theory to fit models to data.

From that point, I started to work backwards. If you want optimization methods for doing it, you need calculus to do that. If you want to describe your models, you need linear algebra, and you need a bit of calculus… And you still need probability theory to fit models to data and interpret results.

And after I got those three, essentially broke down into small pieces, basically. I highlighted a few key milestones which you will encounter later… As I said, I tried to be as minimal as possible.

Practical AI #126

Recommender systems and high-frequency trading

David Sweet, author of “Tuning Up: From A/B testing to Bayesian optimization”, introduces Dan and Chris to system tuning, and takes them from A/B testing to response surface methodology, contextual bandit, and finally bayesian optimization. Along the way, we get fascinating insights into recommender systems and high-frequency trading!

Matched from the episode's transcript 👇

David Sweet: We can talk about them… Maybe I’ll try to do them from most interesting to least, but… One thing - there’s this interesting cultural dynamic in finance – in trading specifically. I’ll even narrow it down more - quantitative trading, where people, especially when they’re new to the field, they wanna come in and they wanna try the latest and greatest algorithms and ideas, and everything they’ve learned recently in school, from papers and whatnot, and make some money. Build the magic machine that makes a ton of money.

On the other side, you’ve got people who’ve been doing it for a while, usually [unintelligible 00:14:02.12] who roll their eyes at every new thing, like “Ahh… That’s not gonna work. Neural networks don’t work. SVMs don’t work.” And sometimes they’re right, sometimes they’re wrong… I think if you say something’s not gonna work, you’ll usually be right, but you just won’t be productive… So it’s one of the unfortunate aspects of the distribution of quality of new ideas in engineering.

So what I find is - I’ve seen people try, or I’ve been one of those who have tried all kinds of things. Basically, if you wanted to just randomly throw out ideas [unintelligible 00:14:34.14] And some of the things stick. Some people figure out how to get things to work.

The big problems with financial data are the signal-to-noise ratio is very low. The signals aren’t just small, but they’re competed away. The act of going and trading on signals which your competitors are seeing as well, is squashing the signals. So it creates this non-stationarity where over time your strategies become less and less tradable, sometimes very quickly. So you constantly have to adapt and look for new ways to predict or to trade.

One thing that – you mentioned reinforcement learning, and that brought to mind… I don’t think reinforcement learning is ready to just turn it on and get a usable answer out of, in finance. I haven’t seen that. And I say that only – I say it because it’s hard. I feel like it’s still cutting-edge for solving this kind of problem. I see a lot of promise in offline reinforcement learning, what’s been going on the past year or so… It’s just amazing, and it’s very much in line with… It’s like a machine learning replacement for the old - or an AI replacement, I’ll say - for the old-school simulation optimization; like, how do you make that more automated, or more autonomous, or hyper-automated, or get that next level of automation. So yeah, I see a lot of promise, but I haven’t seen people just kind of taking that out of the box and making it work.

[15:58] A contextual bandit on the other hand, which is a limited subset of reinforcement learning - not only do I think that that’s directly useful, but I think people in finance have been doing it ad-hoc for a long time anyway… You know, if not the most super-efficient way it can be done, like people understand it these days, I think, since the beginning of my [unintelligible 00:16:17.12] doing things that kind of look to me like a contextual bandit.

What makes that easier than a full reinforcement learning problem is that you’re only predicting the immediate reward, so you don’t have to worry about your decision now affecting the state of the world for your decision later, and then have this compounding of state changes based on previous decisions. That’s a more IID sample, so to speak, to build your model with.

Practical AI #124

Green AI 🌲

Empirical analysis from Roy Schwartz (Hebrew University of Jerusalem) and Jesse Dodge (AI2) suggests the AI research community has paid relatively little attention to computational efficiency. A focus on accuracy rather than efficiency increases the carbon footprint of AI research and increases research inequality. In this episode, Jesse and Roy advocate for increased research activity in Green AI (AI research that is more environmentally friendly and inclusive). They highlight success stories and help us understand the practicalities of making our workflows more efficient.

Matched from the episode's transcript 👇

Jesse Dodge: Yeah, I’ll talk a little bit about this. One thing that I’ve mentioned already was performance efficiency trade-offs, and I think that the key idea here, and one thing that we’ve found when we did this survey that Roy mentioned, of papers in our field, is that most papers just don’t report anything. They don’t report any efficiency-related metrics at all. Most papers in our field invent some new model, or some new loss function, some new training scheme, something like that, and then claim in a table “Here is our better performance. We beat our baselines.” But they don’t report, for example, training curves, or some other measure where you can trade off efficiency and performance. Maybe accuracy could be one measure of performance.

[35:56] So an example of this – and I guess the first thing that I would say here is what we hope everyone in the research community starts to do (and we are seeing this happen now) is just report something; report some measure of how… Maybe it’s going to be the floating-point operations to run your mode. Maybe it’s gonna be a training curve. Maybe it’s gonna be the results from your hyperparameter optimization search.

One example of this I can point to is a paper - and I use this as a positive example of how somebody can report this kind of information. So Roy and I wrote a paper that used early stopping… So we partway-processed an example, and then potentially had our model stop early. So instead of feeding the example all the way through our model, and then coming up with a prediction at the end, we had ways for our model to stop this computation early and make a decision quickly. And this method allowed us to show performance efficiency trade-offs, these smooth curves, which anyone can then compare against at any point.

And what I would hope to see is other work come along and show a better curve, rather than just a single point on this performance efficiency trade-off; they can report just “Here’s how efficient my model was, and here’s the performance”, potentially beating our entire curve, or just a single point better along one of those dimensions. In this way, just reporting more information allows others to compete along either of those dimensions, or potentially draw a better curve.

Practical AI #19

Getting into data science and AI

Himani Agrawal joins Daniel and Chris to talk about how she got into data science and artificial intelligence, and offers advice to others getting into these fields. She goes on to describe the role of artificial intelligence and machine learning within AT&T and telecom in general.

Matched from the episode's transcript 👇

Himani Agrawal: I definitely felt that way. I realized that during my Ph.D. I was working on solving optimization problems, which are very similar to the problems in the real data science industry, but I was not using the same jargons that are being used in the tech industry… So by being part of the Galvanize program I got to learn data science in the tech way. That really helped me a lot, and it helps me even today in my job at AT&T.

Coming to your second question, how can people from different backgrounds enter into data science - I believe that machine learning and data science is very ubiquitous right now. There is a huge scarcity of machine learning and data science expertise, so it’s great if people from different backgrounds can enter into that field, because that would really spark creativity.

Practical AI #237

Automating code optimization with LLMs

You might have heard a lot about code generation tools using AI, but could LLMs and generative AI make our existing code better? In this episode, we sit down with Mike from TurinTech to hear about practical code optimizations using AI “translation” of slow to fast code. We learn about their process for accomplishing this task along with impressive results when automated code optimization is run on existing open source projects.

snarky.ca

Selecting a programming language can be a form of premature optimization

The bulk of this post by Brett Cannon is a detailed argument that Python makes sense to select even for projects with known performance concerns, but I got my money’s worth from the concept in the title and opener:

… it dawned on me that the problem is people are not treating language selection as potential form of premature optimization: if you select a programming language based on your preconceived notions of how a language performs, you will never know if the language that might be a better, more productive fit for your developers would have actually worked out.

hakibenita.com

Common mistakes and missed optimization opportunities in SQL

In an effort to make my team write better SQL, I went over reports written by non-developers and code reviews, and gathered common mistakes and missed optimization opportunities in SQL.

Dividing integers, accidentally counting nullable columns, column position in GROUP BY and ORDER BY, and 9 other common gotchas. Don’t get got!

victorzhou.com

How I fell into the trap of premature optimization

Donald Knuth famously said:

The real problem is that programmers have spent far too much time worrying about efficiency in the wrong places and at the wrong times; premature optimization is the root of all evil (or at least most of it) in programming.

You’ve either a) learned this lesson the hard way, b) learned it the easy way (by listening to others’ tales of woe), or you c) should learn it now alongside Victor Zhou as he recounts how he ignored Knuth and wasted a lot of time because of it.

humus.name

Rules of optimization

Emil Persson’s optimization tweet was so well received that he decided to turn its <ol> of rules into a full-on blog post:

Basically Programming Wisdom … posted a quote that basically suggested more or less that there’s never a good time to think about performance. Even experts should defer it until later! This is way worse advice than your usual “premature optimization is the root of all evil” tirade.

I’m not a fan of premature optimization, myself. So there’s lots to ponder in this post. 🤔

changelog.com/posts

rack-pagespeed: Rack middleware for page speed optimization

Thanks to work by Google and Yahoo, we’re all better informed about how to speed up our web pages. For those on Apache, Google has made it easier to implement these ideas at the Apache Module level. For those running on Ruby web frameworks, Julio Cesar offers up the same goodness as Rack middleware.

Rack Pagespeed offers some HTML output filters to help you implement page optimization best practices with minimal effort. To get started, install the gem:

gem install rack-pagespeed

For Sinatra, Rack, or Padrino apps, configure Rack Pagespeed in your config.ru Rackup file:

require 'rack/pagespeed'
require 'myapp'
use Rack::PageSpeed, :public => "/app/public/dir" do
  store :disk => Dir.tmpdir # require 'tmpdir'
  inline_javascript :max_size => 4000
  inline_css
  combine_javascripts
end
run Sinatra::Application

For those Rails, create a rack_pagespeed.rb initializer with:

require 'rack/pagespeed' # somewhere
class Application < Rails::Application
  config.middleware.use Rack::PageSpeed, :public => Rails.public_path do
    store :disk => Dir.tmpdir # require 'tmpdir'
    inline_javascript :max_size => 4000
    inline_css
    combine_javascripts
  end
  # ...

Filters

Out of the box, Rack Pagespeed supports filters to:

Inline JavaScript under 2kb
Inline CSS under 2kb
Bundle JavaScript files
Minify JavaScript files
Bundle CSS files
Inline images using data-uri

You can even roll your own filters.

Storage

Rack Pagespeed currently supports two storage options: disk and memcached. See the well designed docs for advanced options.

[Source on GitHub] [Web site]

postgres.ai

Joe bot, an SQL query optimization assistant

Joe detects performance bottlenecks and recommends optimizations from the comfort of your Slack team.

Practical AI #240

Generative models: exploration to deployment

What is the model lifecycle like for experimenting with and then deploying generative AI models? Although there are some similarities, this lifecycle differs somewhat from previous data science practices in that models are typically not trained from scratch (or even fine-tuned). Chris and Daniel give a high level overview in this effort and discuss model optimization and serving.

Practical AI #234

Vector databases (beyond the hype)

There’s so much talk (and hype) these days about vector databases. We thought it would be timely and practical to have someone on the show that has been hands on with the various options and actually tried to build applications leveraging vector search. Prashanth Rao is a real practitioner that has spent and huge amount of time exploring the expanding set of vector database offerings. After introducing vector database and giving us a mental model of how they fit in with other datastores, Prashanth digs into the trade offs as related to indices, hosting options, embedding vs. query optimization, and more.

Practical AI #221

Large models on CPUs

Model sizes are crazy these days with billions and billions of parameters. As Mark Kurtz explains in this episode, this makes inference slow and expensive despite the fact that up to 90%+ of the parameters don’t influence the outputs at all.

Mark helps us understand all of the practicalities and progress that is being made in model optimization and CPU inference, including the increasing opportunities to run LLMs and other Generative AI models on commodity hardware.

go.dev

Thirteen years of Go

Russ Cox, for the Go team:

Today we celebrate the thirteenth birthday of the Go open source release. The Gopher is a teenager!

It’s been an eventful year for Go. The most significant event was the release of Go 1.18 in March, which brought many improvements but most notably Go workspaces, fuzzing, and generics.

He goes on to describe many of the other notable features and events of the past year and closes with a glance into Go’s future:

In Go’s 14th year, we’ll keep working to make Go the best environment for software engineering at scale. We plan to focus particularly on supply chain security, improved compatibility, and structured logging, all of which have been linked already in this post. And there will be plenty of other improvements as well, including profile-guided optimization.

jeremybrown.tech

Kubernetes is a red flag signalling premature optimisation

Jeremy Brown:

It feels bizarre saying this, having spent so much of my life advocating for and selling a distribution of Kubernetes and consulting services to help folks get the most of out it, but here goes! YOU probably shouldn’t use Kubernetes and a bunch of other “cool” things for your product.

This post turns out to be less about Kubernetes and more about premature optimization and doing more with less. Also it’s about Kubernetes. 😉

dmitrytsepelev.dev

Why Ruby has symbols

Most Ruby engineers know the difference between symbols and strings from a usage perspective. So, in this article I look at them from a different angle.

You’ll learn about a cool compiler optimization called “string interning”, which is used in many languages. Also, you will understand what happens when (almost any) interpreter runs your code, take a look at the AST and take a peak at the symbols interpretation at the low level.

Practical AI #176

MLOps is NOT Real

We all hear a lot about MLOps these days, but where does MLOps end and DevOps begin? Our friend Luis from OctoML joins us in this episode to discuss treating AI/ML models as regular software components (once they are trained and ready for deployment). We get into topics including optimization on various kinds of hardware and deployment of models at the edge.

martinheinz.dev

Profiling and analyzing performance of Python programs

Martin Heinz on the tools/techniques for finding bottlenecks in your Python code. And fixing them, fast.

The first rule of optimization is to not do it. If you really have to though, then optimize where appropriate. Use the above profiling tools to find bottlenecks, so you don’t waste time optimizing some inconsequential piece of code. It’s also useful to create a reproducible benchmark for the piece of code you’re trying to optimize, so that you can measure the actual improvement.

github.com

toyDB – a distributed SQL db written in Rust

This is not a use-it-in-the-real-world kinda thing. It’s being written as a learning project, but may interest you if you want to learn about database internals. It includes:

Raft-based distributed consensus engine for linearizable state machine replication.
ACID-compliant transaction engine with MVCC-based snapshot isolation.
Pluggable storage engine with B+tree and log-structured backends.
Iterator-based query engine with heuristic optimization and time-travel support.
SQL interface including projections, filters, joins, aggregates, and transactions.

blog.feenk.com

Developers spend most of their time figuring the system out

Tudor Girba unpacks the statement “developers spend most of their time figuring the system out.”

…reading is just the means through which information is gathered from data. It also happens to be the most manual possible way to do that, so this lends itself to an important opportunity for optimization.

Before you can do something significant about anything, you have to name it. Otherwise it’s like with Voldemort. Many winters ago, I called the effort of “figuring the system out to know what to do next” assessment. And I claimed we should optimize development around it. For a whole decade my colleagues and I explored this idea. And it led us to what we now call moldable development.

developers.mattermost.com

Tuning MySQL and the ghost of index merge intersection

Agniva De Sarker:

This is the story of an (apparently) smart optimization to a SQL query that backfired spectacularly and how we finally fixed it.

An excellent tale told in two parts. Stick around for the finale for a solid mic drop moment.

opensource.googleblog.com

New case studies about Google’s use of Go

From Rob Pike on Google’s Open Source Blog:

In the past year, we’ve posted sixteen case studies from end users around the world talking about how they use Go to build fast, reliable, and efficient software at scale. Today, we are adding three new case studies from teams inside Google:

Core Data Solutions: Google’s Core Data team replaced a monolithic indexing pipeline written in C++ with a more flexible system of microservices, the majority of them written in Go, that help support Google Search.

Google Chrome: Mobile users of Google Chrome in lite mode rely on the Chrome Optimization Guide server to deliver hints for optimizing page loads of well-known sites in their geographic area. That server, written in Go, helps deliver faster page loads and lowered data usage to millions of users daily.

Firebase: Google Cloud customers turn to Firebase as their mobile and web hosting platform of choice. After joining Google, the team completely migrated its backend servers from Node.js to Go, for the easy concurrency and efficient execution.

Want to share your story about how your team or organization uses Go? Share your story here.

blog.discordapp.com

Why Discord is switching from Go to Rust

The TLDR of their reasoning is Go’s garbage collection was causing performance problems at scale. Since Rust doesn’t have a garbage collector, it allowed the team to manage their memory use more effectively. Their results were… uplifting:

Remarkably, we had only put very basic thought into optimization as the Rust version was written. Even with just basic optimization, Rust was able to outperform the hyper hand-tuned Go version. This is a huge testament to how easy it is to write efficient programs with Rust compared to the deep dive we had to do with Go.

This is not a Go sucks switch to Rust story. It is a well-reasoned argument for using one technology over the other when it makes sense to do so.

When starting a new project or software component, we consider using Rust. Of course, we only use it where it makes sense.

github.com

Efficient, reusable components for 3D computer vision research with PyTorch

PyTorch3d is designed to integrate smoothly with deep learning methods for predicting and manipulating 3D data. For this reason, all operators in PyTorch3d:

Are implemented using PyTorch tensors

Can handle minibatches of hetereogenous data

Can be differentiated

Can utilize GPUs for acceleration

Get started with tutorials on deforming a sphere mesh into a dolphin, rendering textured meshes, camera position optimization, and more.

github.com

Practices for writing high-performance Go

From writing and optimizing Go code to common gotchas with the Go standard library, Damian Gryski shared his thoughts on Go performance optimization and outlined best practices for writing high-performance Go code. Available in English, 中文, and Español.

When and where to optimize — Every optimization has a cost. Generally this cost is expressed in terms of code complexity or cognitive load – optimized code is rarely simpler than the unoptimized version. But there’s another side that I’ll call the economics of optimization. As a programmer, your time is valuable. There’s the opportunity cost of what else you could be working on for your project, which bugs to fix, which features to add. Optimizing things is fun, but it’s not always the right task to choose. Performance is a feature, but so is shipping, and so is correctness.