@Alexandre_Mutel
Director C#/.NET Tech Group at Unity, OSS, lang/compilers, GPU/sound, architecture 🏎️ Microsoft MVP, ex-demoscene PC/Amiga 🎆 Veggie 🌿, opinions are my own.
2023
- 10x Performance with SIMD Vectorized Code in C#/.NET- Jul 9C#,.NET,x86,assemblerUse your CPU at its full width!
2018
- Generate automatically async/await code from sync code with Roslyn- Dec 26C#,.NET,Roslyn
- Writing a Managed JIT in C# with CoreCLR- Apr 12C#,.NET,CoreCLR
- Porting the Unity Engine to .NET CoreCLR- Apr 6C#,.NET,CoreCLR,Unity
- Productivity with ReSharper- Mar 9Visual Studio,Visual Studio 2015,Roslyn
2009
Random float number generator using x86 ASM code optimized in size- Oct 25assembler, x86
@Bishal Santra
Research Engineer @microsoftResearch India | Working on Language Modeling for Retrieval | IIT KGP
- Aug 12, 2023
  Explaining Issues with Channel Method in LLM Prompt-based Classification
- Feb 11, 2023
  Prove that the set of algebraic numbers are countable (using primes)
- Feb 10, 2023
  Is ∑_i=1ⁿ 1/n divergent?
- Oct 7, 2021
  How to Connect through an SSH Jump Server (For CNeRG project students)
- Oct 26, 2019
  Transformer Language Models and Pretraining
- Aug 31, 2019
  English to Hindi Transliteration using Seq2Seq Model
- Aug 31, 2019
  Deep Sentiment Analysis
- Jul 13, 2019
  Text Classification using Naive Bayes Method
- Jul 8, 2019
  Training a Language Model with a Xtra-Small Transformer (Transformer-XS)
- Jul 7, 2019
  Simple Reddit Dialogue Preprocessor
- Jul 3, 2019
  How I created this site using Jekyll?
- Jan 12, 2017
  Generating Gamma Random Variable in CUDA in Parallel
- Aug 6, 2016
  Minimum Mean Squared Error (MMSE) Estimator
- Mar 14, 2016
  DecycledJSON - Circular reference breakers for JSON
- Aug 6, 2016
  Minimum Mean Squared Error (MMSE) Estimator
Kristoffer Carlsson
Software engineer, Julia Computing
Kristoffer Carlsson
SIMD and SIMD-intrinsics in Julia Tue, Nov 13, 2018
Case study: Improving performance of a code written in Matlab style Mon, Dec 26, 2016
Stathis Kamperis
I am a radiation oncologist and physicist. I like to build bridges between different scientific disciplines (medicine, physics, informatics).
Danielle Navarro
Hi there! I’m Danielle Navarro. I’m a data scientist,generative artist, and arecovering academic living in Sydney with my two kids and a Netflix subscription. Once upon a time I was a mathematical psychologist. After that I was developer advocate and occasional software engineer. I’ve sometimes been accused of being a statistician.
Miles Cranmer
Hi there! I’m Miles Cranmer.
DavisVaughan
Hi there! I’m DavisVaughan.
Jon Shlens
Hi there! I’m Jon Shlens.
Yihui Xie
Hi there! I’m Yihui Xie. I’m a Freelancer (open source programmer, contractor, blogger, and writer)
ThomasLumley
on Mastodon and Blu Esky
Prof Richard Xu
Hi there! I’m Prof Richard Xu.
I am a Professor at the Department of Mathematics, Hong Kong Baptist University (HKBU) 香港浸会大学数学系教授
Alexander Fischer
Hi there! I’m Alexander Fischer.
Data Scientist @trivago
Ross Wightman
Hi there! I’m Ross Wightman.
Computer Vision @huggingface. Always learning, constantly curious. Building ML/AI systems, watching loss curves.
Steven G. Johnson
Hi there! I’m Steven G. Johnson.
Professor of Applied Mathematics and Physics, Massachusetts Institute of Technology
Yixuan Qiu
Hi there! I’m Yixuan Qiu.
Currently an associate professor in School of Statistics and Management atShanghai University of Finance and Economics (SUFE).
Roger Koenker
Hi there! I’m Roger Koenker.
Thomas Stringer
Hi there! I’m Thomas Stringer.
Martin Evans
Hi there! I’m Martin Evans.
http://martindevans.me
Cédric Luthi
Hi there! I’m Cédric Luthi.
Mark Heath
Hi there! I’m Mark Heath.
Elasticsearch
Elasticsearch. The heart of the Elastic Stack
Philipp Wagner
Hi there! I’m Philipp Wagner.
Danilo Poccia
Hi there! I’m Danilo Poccia.
KerasHub: Multi-framework Pretrained Models
Pretrained model hub for Keras 3.
Mohammad Elsheimy
Mohammad Elsheimy
Yoshifumi Kawai
Yoshifumi Kawai
Dapr
Dapr
ABP
ABP offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET and the ASP.NET Core platforms.
OpenTelemetry - CNCF
Hi there! I’m OpenTelemetry - CNCF.
Safia Abdalla
Hi there! I’m Safia Abdalla.
Anthony Sneed
Anthony Sneed
Conrad Ludgate
Conrad Ludgate.
SQL Join
SQL Join.
Vincent D. Warmerdam
Vincent D. Warmerdam
Single instruction, multiple data (SIMD)
Single instruction, multiple data (SIMD).
SciML Open Source Scientific Machine Learning
Open source software for scientific machine learning
Linear Regression in Machine learning
Linear Regression in Machine learning
Sam Grey Danus
Sam Grey Danus
Yunjey Choi
Yunjey Choi
Paul Berg
Paul Berg
Benoît Legat
Benoît Legat
Dask
Parallel computing with task scheduling
The Python Pickle Module
The Python Pickle Module
Andriy Burkov
Hi there! I’m Andriy Burkov.
.NET 8 container workshop
.NET 8 container workshop.
Martin Krasser
Martin Krasser.
Simon Willison
Hi there! I’m Simon Willison.
Ahmet Alp Balkan
Ahmet Alp Balkan.
Anish Athalye
Anish Athalye.
Xuan-Son Nguyen
Xuan-Son Nguyen
Ross Wightman
Hi there! I’m Ross Wightman.
Computer Vision @huggingface. Always learning, constantly curious. Building ML/AI systems, watching loss curves.
These are my PRs
Alex Chi Z.
Alex Chi Z.
Rotation related problems
Rotation_related_problems. Quarternions. Rodrigues’ rotation formula
CUDA Programming Model
CUDA Programming Model
Mat Leonard
Mat Leonard.
Oriol Nieto
Oriol Nieto.
Guillaume Guy
Guillaume Guy
DB
DB.
Books
Hi there! I’m Books.
Span<T> and Pipelines
Span<T> and Pipelines
Reshama Shaikh (@reshamas)
Hi there! I’m Reshama Shaikh (@reshamas).
Su Yang
Su Yang
Asankhaya Sharma
Asankhaya Sharma
Max Liani
Max Liani
Microsoft MVP
Microsoft MVP
Anthony Shaw(@tonybaloney)
Anthony Shaw(@tonybaloney)
Ethan Harris(@ethanwharris)
Ethan Harris(@ethanwharris)
Jirka Borovec(@Borda)
Jirka Borovec(@Borda)
David Pine(@IEvangelist)
David Pine(@IEvangelist)
Friedrich von Never(@ForNeVeR)
Friedrich von Never(@ForNeVeR)
Adam Sitnik(@adamsitnik)
Adam Sitnik(@adamsitnik)
Ash Vardanian(@ashvardanian)
Ash Vardanian(@ashvardanian)
Jimmy Lefevre(@JimmyLefevre)
Jimmy Lefevre(@JimmyLefevre)
Shay Rojansky(@roji)
Shay Rojansky(@roji)
Tania Allard(@trallard)
Tania Allard(@trallard)
Caleb-robinson - Understanding intersection-over-union
August 13, 2023 Solving a game called 'ball sort puzzle'
October 02, 2019 Generating elementary cellular automata with Python
October 22, 2018 How to reproduce ImageNet validation results
October 22, 2018 GitHub: How to reproduce ImageNet validation results
Dr Pasquale Minervini - Some notes on Gaussian Fields and Label Propagation

Peter Melchior -

Scarlet2 – Thoughts for a major redesign
Bayesian inference three ways Running MCMC, Hamiltonian MC, and simulation-based inference with a few lines of code

Nicolas P. Rougier - From Python to Numpy
There are already a fair number of books about Numpy (see Bibliography) and a legitimate question is to wonder if another book is really necessary. As you may have guessed by reading these lines, my personal answer is yes, mostly because I think there is room for a different approach concentrating on the migration from Python to Numpy through vectorization.
Heiner Küttler -
Feb 19, 2023The chain rule, Jacobians, autograd, and shapes
Apr 14, 2021Gompertz, annuities, and special functions
Mar 23, 2021πs, deaths, and statistics
Dec 29, 2020Well-definedness in measure theory
Sep 8, 2020More on linear regression: Capital asset pricing models
Feb 19, 2020On linear regression
Aug 19, 2019Annuity loans
Dec 25, 2016Dylanchords
Aug 21, 2016In every beginning there is a delusion
Felix-Altenberger - ML Engineer at ZenML. Posting about ML, MLOps, Computer Vision.
2021, Mar 30 A Guide for Building GANs - 10 Tips and Tricks
Feb 19, 2023 An Analysis of ICP Variants
Feb 19, 2023 3D Reconstruction with Differentiable ICP
Armin Ronachermitsuhiko
2018, Mar 18 You can't Rust that Some tips for how to be more productive in Rust by avoiding situations you cannot solve in Rust.
Oct 19, 2016 I don't understand Python's Asyncio
Nov 18, 2015 Python's Hidden Regular Expression Gems
Peter Goldsborough
2018, Feb 4 A Promenade of PyTorch.
Jan 20, 2018 Writing a Microservice in Rust
Aayush Agrawal
2022, Oct 12 Model calibration for classification tasks using Python
Jan 20, 2024 ML Interview Prepration Guide (Draft)
ezyang 's blog
PyTorch internals(May 16, 2019)
Vincent_Qin 's blog
📝Note: Python zip()
Light Field Depth Estimation( 16/05/2018)
📝Note: 5 seconds to train NeRF, NVIDIA Instant NeRF test( 01-05-2022)
📝Note: SLAM FAQ (IV): Solve ICP and use SVD decomposition to get the rotation matrix( 18-08-2019)
Andrew Tulloch—Machine Learning, Statistics, Systems
About | Academic | GitHub | CV
Improving PyTorch inference performance on GPUs with a few simple tricks( 3/10/2021)
The LASSO Estimator( 13/05/2014)
Cambridge Part III Mathematics Notes( 27/10/2014)
Andrew Tulloch—Machine Learning, Statistics, Systems
About | Academic | GitHub | CV
Deep Linear Models( 10/10/2022)
Double Descent( 13/11/2021)
Hugo-Bowne-Andersondata scientist - educator - writer - podcaster
2020, Apr 14 How to do Bayesian statistical modelling using numpy and PyMC3.
2020, Apr 14 GitHUb Repository: How to do Bayesian statistical modelling using numpy and PyMC3.
Sanjeev Sharma@sanjeevs_iitr, Founder & CEO, Swaayatt Robot, Deep Eigen
2024, Aug 24 Projective 3-Point (P3P) algorithm plays a crucial role in understanding the Visual Odometry and typical SLAM algorithmic pipelines.
Greg Brockman President & Co-Founder @OpenAI
2019, Jul 30 How I became a machine learning practitioner
2025, Jan 01 If you are starting ML in 2025, read this blog by @gdb. I believe it will help you out. Link: TwitterID: goyal__pramod
Grant Slatton
Formerly built the world's fastest filesystem at AWS, now the fastest spreadsheet at rowzero.com
Binary IQ — A model of LLM capability
Lightweight property-based testing at Row Zero — How we verify correctness
Rust Macros: Zero to Hero — A comprehensive guide on Rust macros
Algorithms we develop software by — Pathfinding applied to the software solution domain
Building Filesystems — High level ideas in filesystem design
Quasirandom sequences — Cool method to generate non-clumping random points
Kanaka Rajan I'm an Entreprenuer| Researcher
2024, Oct 17 research. Resources.
Sewon Min Incoming faculty @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp.
John Parkhill ML, director of machine learning Terray Therapeutics (https://x.com/Terray_Tx).
May 30, 2021
Pricing Options with TorchSDE
Apr 30, 2021
Solving multidimensional PDEs in pytorch
Apr 7, 2021
Using simple mean-reversion to remove carry from a VIX futures position
Mar 6, 2021
Simply Extracting information out of your Stochastic Series
Jan 6, 2021
Copulas made simple with Pytorch Distributions
Dec 5, 2020
How to Size Bets. The Kelly Criterion in PyTorch
Sep 21, 2020
Some Robot Art
Sep 5, 2020
Quaternion Averaging in Pytorch
Christian S. Perone / Machine Learning (@polymtl/@UMontreal)
- The geometry of data: the missing metric tensor and the Stein score [Part II]
- Torch Titan distributed training code analysis
- Memory-mapped CPU tensor between Torch, Numpy, Jax and TensorFlow
- Generalisation, Kant’s schematism and Borges’ Funes el memorioso – Part I
- PyTorch 2 Internals – Talk
- Thoughts on Riemannian metrics and its connection with diffusion/score matching [Part I]
- Large language model data pipelines and Common Crawl (WARC/WAT/WET)
- Feste: composing NLP tasks with automatic parallelization and batching
- Couple of recent publications in uncertainty estimation and autonomous vehicles
- [pt-br] Dados das enchentes no Rio Grande do Sul (RS) em 2024
- Tutorial on using LLVM to JIT PyTorch fx graphs to native code (x86/arm/risc-v/w...)
- Arduino WAN, Helium network and cryptographic co-processor
- Episuite: epidemiology in Python
Jonathan Frankle
Chief AI Scientist at Databricks. Founding team at MosaicML. MIT/Princeton alum. Lottery ticket enthusiast. Working on data intelligence.
Nov 26, 2024
Reposted by Jonathan Frankle: Ofir Press‬ ‪@ofirpress.bsky.social· I wrote some thoughts on how to build good LM benchmarks: ofir.io/How-to-Build...
Charlie Marsh
Building Astral: high-performance tools for Python, starting with Ruff.

In the past: Staff software engineer at Spring Discovery, senior engineer at Khan Academy, and Computer Science major at Princeton.

These days, I write on Notion.

Check out some of my public projects:
Hi, I'm Charlie Marsh.
I'm building high-performance developer tools for Python, starting with Ruff, an extremely fast Python linter written in Rust.
I was most recently a staff software engineer at Spring Discovery. Before that, I was a senior software engineer at Khan Academy.
This is a collection of notes and blog posts I’ve written on Notion:
🐍Using Mypy in production at Spring
🌐What’s WebAssembly?
🛠️Python tooling could be much, much faster
🤖Building large language model-powered applications
☁️Isolates, microVMs, and WebAssembly
⚡Ruff: The First 200 Releases
You can find me on Twitter.
For older posts and projects, check out my personal site.
kevin frans website v5
Hey, I'm Kevin. I am a PhD student at BAIRadvised by Pieter Abbeel andSergey Levine. I did my B.S. and M.Eng at MIT with Phillip Isola. I am interested in deep reinforcement learning, unsupervised learning, and AI-based creative tools. I also lead engineering at ParagraphAI. I have spent time atCross Labs, Sizigi,Autodesk Research, and OpenAI. In my free time, I like to design and build video games.
19 Dec 2023
Small-Research: Tanh Activations with DDPG
19 Dec 2023
Small-Research: Policy Extraction in IQL
8 Sep 2023
Successor Representations Explained
1 Aug 2023
Deriving the KL divergence loss in variational autoencoders
15 May 2022
For AGI, we need better tasks. For better tasks, we need open-endedness. (ALOE 2022 Notes)
6 Dec 2021
A Mathematical Definition of Interestingness
12 Nov 2021
To extract information from language models, optimize for causal response
2 Nov 2021
Data digesters, ML^2, Interestingness
28 Jun 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis
28 Jun 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis
3 Apr 2021
StampCA: Growing Emoji with Conditional Neural Cellular Automata
2 Dec 2020
Quality Diversity: Evolving Ocean Creatures
3 Apr 2021
Open-Endedness 3: Multicell World
22 Oct 2020
Open-Endedness 2: Bitwise Chemicals
15 Oct 2020
Open-Endedness 1: Cellworld
27 Nov 2019
Omakase 5: Bullet Dance
21 Jul 2019
Omakase 1: Ropeman
8 Apr 2019
Linking C++ and Python with Boost (Anaconda)
12 Jul 2018
RAIN Project: Evolution of the Game Development Dream
26 Oct 2017
[Link] Learning a Hierarchy
28 Feb 2017
Deepcolor: Automatic Coloring and Shading of Manga-Style Lineart
Alex Nichol
AI researcher, hobby web developer, math geek. Constantly learning.
- Honeycrisp: An Apple-First Deep Learning Framework(11/29/2024)
- Sharing Streaming Services Across Households(01/01/2024)
- Representing 3D Models as Decision Trees(05/20/2023)
- Large-Scale Vehicle Classification(12/31/2022)
- Data and Machines(04/12/2021)
- VQ-DRAW: A New Generative Model(03/04/2020)
- Research Projects That Didn’t Pan Out(01/18/2020)
- Competing in the Obstacle Tower Challenge(07/24/2019)
- Prierarchy: Implicit Hierarchies(04/03/2019)
- Solving murder with Go(12/24/2018)
- What I Don’t Know(12/23/2017)
- Adversarial Train/Test Splits(10/31/2017)
- Decision Trees as RL Policies(08/30/2017)
- Keeping Tabs On All My Neural Networks(07/04/2017)
- Why I’m Remaking OpenAI Universe(06/11/2017)
- The Meta-Learning Quest: Part 1(04/15/2017)
- Slice Aliasing Is Nicer Than You Realize(04/05/2017)
- The Bug That Wasted a Month of GPU Time(03/08/2017)
- Random Fun with Linear SVMs(03/01/2017)
- Can Neural Networks Learn to Spell?(02/14/2017)
- Ancient Philosophy as a Classification Problem(02/12/2017)

Understanding intersection-over-union

Intersection-over-union (IoU), also known as the Jaccard index, is a commonly used measure for determining how accurate a proposed image segmentation is, compared to a known/ground-truth segmentation. In segmentation tasks the IoU is prefered over accuracy as it is not as affected by the class imblances that are inherent in foreground/background segmentation tasks. As an example, if a ground truth image is made up of 90% background pixels, a proposed segmentation that classifies all pixels as background will have an accuracy of 90% whereas it would have an IoU of 0%.

1 Jan 2017 •onmachine learning semi-supervised learning

Solving a game called 'ball sort puzzle'

9 minute read

Published: August 13, 2023

Generating elementary cellular automata with Python

4 minute read

Published: October 02, 2019

How to reproduce ImageNet validation results

4 minute read

Published: October 22, 2018

imagenet_validation Public

How to reproduce ImageNet validation results

34 9

Jupyter Notebook

This code in this repository can be used to reproduce the ImageNet validation results for Keras pretrained models. A blog post describing this process in more detail is here.

Code

1. Preprocess ImageNet validation set - converts the raw ILSVRC2012 validation images/labels into NumPy arrays (.npy files) that can be used “as is” with pre-trained Keras models
2. Benchmark Keras pretrained models on ImageNet.ipynb - uses the preprocessed data and the VGG19 pre-trained network to reproduce the Top-1 and Top-5 accuracy reported in the Keras documentation.

Some notes on Gaussian Fields and Label Propagation

Propagation as a Cost Minimization Problem

Peter Melchior

Scarlet2 – Thoughts for a major redesign Astronomical source modeling and separation, all new and shiny
Bayesian inference three ways Running MCMC, Hamiltonian MC, and simulation-based inference with a few lines of code

From Python to Numpy

This is a collection of numpy exercises from numpy mailing list, stack overflow, and numpy documentation. I've also created some problems myself to reach the 100 limit. The goal of this collection is to offer a quick reference for both old and new users but also to provide a set of exercises for those who teach. For extended exercises, make sure to read From Python to NumPy.

There are already a fair number of books about Numpy (see Bibliography) and a legitimate question is to wonder if another book is really necessary. As you may have guessed by reading these lines, my personal answer is yes, mostly because I think there is room for a different approach concentrating on the migration from Python to Numpy through vectorization. There are a lot of techniques that you don't find in books and such techniques are mostly learned through experience. The goal of this book is to explain some of these techniques and to provide an opportunity for making this experience in the process.

Website: http://www.labri.fr/perso/nrougier/from-python-to-numpy

Introduction

Simple Example
Readability vs Speed

Anatomy of an Array

Introduction
Memory Layout
Views and Copies
Conclusion

Code Vectorization

Introduction
Uniform Vectorization
Temporal Vectorization
Spatial Vectorization
Conclusion

Problem Vectorization

Introduction
Path Finding
Fluid Dynamics
Blue Noise Sampling
Conclusion

Custom Vectorization

Introduction
Typed List
Memory Aware Array
Conclusion

Beyond Numpy

Back to Python
Numpy & co
Scipy & co
Conclusion

Quick References

Data Type
Creation
Indexing
Reshaping
Broadcasting

Bibliography

Tutorials
Articles
Books

Table of Contents

I am Heinrich 'Heiner' Küttler. I am a member of the technical team at xAI. Previously, I was the LLM Infra Lead and a member of the founding team at Inflection AI.We built LLMs. Before this, was a Research Engineering Manager at Meta AI Research in London, leading Reinforcement Learning engineering across EMEA, and before that I was a Senior Research Engineer and Team Lead atDeepMind, working on projects like DMLab, StarCraft, and AGI. I also once was a Technical Solutions Consultant at Google in London. I received my PhD in Mathematical Physics fromLMU Munich in 2014.

Feb 19, 2023The chain rule, Jacobians, autograd, and shapes
Apr 14, 2021Gompertz, annuities, and special functions
Mar 23, 2021πs, deaths, and statistics
Dec 29, 2020Well-definedness in measure theory
Sep 8, 2020More on linear regression: Capital asset pricing models
Feb 19, 2020On linear regression
Aug 19, 2019Annuity loans
Dec 25, 2016Dylanchords
Aug 21, 2016In every beginning there is a delusion

πs, deaths, and statistics

PDFs, CDFs, and Hazard Functions

If you have taken a probability or statistics course, you probably (ha!) know about probability density functions (pdfs). A pdf is a positive function that we use as a density and to make it aprobabilty density it needs to integrate to one. If

f

is a pdf and

X

is a random variable with that distribution then

P (a < X \leq b) = \int_{a}^{b} f (x) d x,

An Analysis of ICP Variants

Over the years, various ICP modifications have been proposed. Now, which one should you use?...

2021, Mar 30 — 6 minute read

A Guide for Building GANs - 10 Tips and Tricks

After having focused on GANs exclusively for the last year and a half, I wanted...

2020, Dec 30 — 7 minute read

3D Reconstruction with Differentiable ICP

# Python # PyTorch # ML # Graphics # 3D Reconstruction # ICP # WithCode

2021, Mar 30 — 9 minute read

Duration: July 2020 to March 2021 (9 months)
Team: Me and supervising prof.
My Responsibilities: Research, design and implementation of differentiable ICP in Pytorch, ML model training and evaluation
Source Code: https://github.com/fa9r/DiffICP

Differentiable ICP

The ICP algorithm consists of the following five steps: source point selection, correspondence search, correspondence weighting, correspondence rejection, and the minimization of an error metric. Source point selection and correspondence weighting are by default differentiable, so it is the remaining three steps that we need to explore in more detail.

Differentiable Correspondence Finding

Standard ICP correspondences are found by searching the nearest neighbor of each source point within the target point cloud, which can be formulated as follows:

The problem here is that the argmin operation is not properly differentiable, since its derivative is everywhere either 0 or undefined. There exist a variety of approximate methods, but they are similarly based on concrete selections and are, therefore, not differentiable either.

Fortunately, a soft relaxation can be formulated by expressing correspondence points as linear combinations of all target points with weights calculated as the softmax over negative distances:

Armin Ronachermitsuhiko

Software developer and Open Source nut. Creator of the Flask framework. Engineering at@getsentry. Other things of interest:@palletsand@rust-lang

Mar 31, 2018

You can't Rust that

Some tips for how to be more productive in Rust by avoiding situations you cannot solve in Rust.

Oct 30, 2016

I don't understand Python's Asyncio

A little confession that I have no idea how asyncio works in Python 3.

The Primitives

asyncio is supposed to implement asynchronous IO with the help of coroutines. Originally implemented as a library around the yield andyield from expressions it's now a much more complex beast as the language evolved at the same time. So here is the current set of things that you need to know exist:

event loops
event loop policies
awaitables
coroutine functions
old style coroutine functions
coroutines
coroutine wrappers
generators
futures
concurrent futures
tasks
handles
executors
transports
protocols

In addition the language gained a few special methods that are new:

Nov 18, 2015

Python's Hidden Regular Expression Gems

Some hidden features of the Python re module and the support machinery that drives it.

There are many terrible modules in the Python standard library, but the Python re module is not one of them. While it's old and has not been updated in many years, it's one of the best of all dynamic languages I would argue.

Fixing up Groups

One annoying thing is that our group indexes are not local to our own regular expression but to the combined one. This means if you have a rule likeand you want to access that group by index, it will be wrong. This would require a bit of extra engineering with a class that wraps the SRE match object with a custom one that adjusts the indexes and group names. If you are curious about that I made a more complex version of the above solution that implements a proper match wrapper in a github repository together with some samples of what you can do with it.

Peter Goldsborough

A Promenade of PyTorch

For the past two years, I’ve been quite heavily invested inTensorFlow, either writing papers about it, givingtalks on how to extend its backend or using it for my own deep learning research. As part of this journey, I’ve gotten quite a good sense of both TensorFlow’s strong points as well as weaknesses – or simply architectural decisions – that leave room for competition. That said, I have recently joined the PyTorch team at Facebook AI Research (FAIR), arguably TensorFlow’s biggest competitor to date, and currently much favored in the research community for reasons that will become apparent in subsequent paragraphs.

No Name

FYI: this idea of constructing a computation graph at runtime was done by Acar at CMU for self adjusting computations. You might be able to steal some ideas from them.

Non-Blocking Parallelism for Services in Go

Aayush Agrawal

I’m an experienced Data Scientist with specialized skills in machine learning-based solutions. I enjoy staying on top of cutting-edge data technologies, including big data platforms, deep learning, optimization methods, and business analytics. My current work involves building data-driven products to enable smarter recommendations for Microsoft Partners, M365 service administrators and end-users to ensure the best usage of M365 services. Before that, I have experience working in various verticals like agricultural technology, pharmaceuticals, retail, e-commerce, and ride-sharing business model.

Model calibration for classification tasks using Python

6 min

Model Calibration

Machine Learning

A hands-on introduction to model calibration using Python.

Oct 12, 2022

ML Interview Prepration Guide (Draft)

11 min

ML Interview Guide

A collection of resources while preparing for MLE interviews at Meta or other big tech companies.

Aug 24, 2024

ezyang 's blog

Edward Z. Yang is a research engineer at Facebook who works on PyTorch, an open source deep learning library. In a previous life, he worked on Backpack, a new module system for Haskell.

You can find more outdated information about me at http://ezyang.com.

Tensor programming for databases, with first class dimensions(October 14, 2024)
What’s different this time? LLM edition(October 4, 2024)
Interactive scraping with Jupyter and Puppeteer(November 23, 2021)
PyTorch Developer Podcast(May 5, 2021)
Rage bug reporting(April 25, 2021)
The PyTorch open source process(January 6, 2021)
The hidden problem(?) with basic block procedures in SSA(October 24, 2020)
Idiomatic algebraic data types in Python with dataclasses and Union(October 14, 2020)
Let’s talk about the PyTorch dispatcher(September 10, 2020)
Dynamic scoping is an effect, implicit parameters are a coeffect(August 27, 2020)
A brief taxonomy of PyTorch operators by shape behavior(May 6, 2020)
vmap in Haskell(January 29, 2020)
PyTorch internals(May 16, 2019)
A short note about functional linear maps(May 15, 2019)
Microsoft Surface Book 2(March 17, 2019)
HIW’18: Let’s Go Mainstream with Eta!(September 23, 2018)
A year into Backpack(July 14, 2018)
A compile-time debugger that helps you write tensor shape checks(April 6, 2018)
Online/offline continuous integration(March 12, 2018)
Semantic Import Versioning in the wild(February 23, 2018)
Systems ML workshop panel(December 8, 2017)
Accelerating Persistent Neural Networks at Datacenter Scale (Daniel Lo)(December 8, 2017)
MOCHA: Federated Multi-Tasks Learning (Virginia Smith)(December 8, 2017)
A Machine Learning Approach to Database Indexes (Alex Beutel)(December 8, 2017)
Ray: A Distributed Execution Framework for Emerging AI Applications (Ion Stoica)(December 8, 2017)
Backpack for deep learning(August 17, 2017)
Proposal: Suggest explicit type application for Foldable length and friends(March 21, 2017)
Prio: Private, Robust, and Scalable Computation of Aggregate Statistics(March 17, 2017)
Designing the Backpack signature ecosystem(March 11, 2017)
How to integrate GHC API programs with Cabal(February 8, 2017)

Vincent_Qin 's blog

🔭 I am currently working on SLAM.
🌱 I am currently learning SLAM and AI.
💬 Ask me about depth estimation/light filed/SLAM etc.
👯 I am looking to collaborate on repo Recent-Stars-2020
📫 How to reach me: vincentqin#hotmail.com (#->@)
⚡ Fun fact: I 🧡🐈

Realcat Vincentqyw(https://github.com/Vincentqyw) starred a repository on 13/5/25
ngxson/smolvlm-realtime-webcam (HTML) 3.2k STARS

Realcat Vincentqyw(https://github.com/Vincentqyw) starred a repository on 7/5/25
huggingface/nanoVLM (Jupyter Notebook 79.9%, Python 20.1%) 961 STARS

Contributor Rankings

#1 Andrés Marafioti - 1 commits | GitHub Profile

andimarafioti Andrés Marafioti

Machine Learning Research Engineer at Hugging Face.

51 repositories236 followers

follows

andimarafioti Oriol Nieto

Senior Research Engineer at Adobe Research. Doctor in music data science (Doctoriol). Oaklander born in Barcelona. He/they.

52 repositories226 followers

Visit the full article: Tutorial - Deep XOR | Posts · 26/2/2017 · 1 minute

Realcat Vincentqyw(https://github.com/Vincentqyw) starred a repository on 3/5/25
skyzh/tiny-llm (Python, C++) 1.8k STARS

11-05

📝Note: Python zip()

2018-05-16

Light Field Depth Estimation

01-05-2022

📝Note: 5 seconds to train NeRF, NVIDIA Instant NeRF test

18-08-2019

📝Note: SLAM FAQ (IV): Solve ICP and use SVD decomposition to get the rotation matrix

Andrew Tulloch 's New blog

Andrew Tulloch 's Old blog

3-10-2021

Improving PyTorch inference performance on GPUs with a few simple tricks

13-05-2014

The LASSO Estimator

27-10-2014

Cambridge Part III Mathematics Notes

I've cleaned up (somewhat) my notes from Cambridge Part III and have put them online - with LaTeX sources available onGitHub and PDFs linked below.

Advanced Financial Models

Advanced Probability

Lecture Notes

Applied Bayesian Statistics

Summary

Convex Optimization

Mathematics Of Operations Research

Non-Parametric Statistics

Percolation

Lecture Notes

Ramsay Theory

Lecture Notes

Statistical Theory

Time Series and Monte Carlo Analysis

Michael Clarke 's blog

From t-tests to deep learning, I've covered a lot of ground in modeling, visualizing, and understanding data. I can provide inference for models on millions of observations, classify biomedical images to determine pathology, and scrape the web to explore political sentiment. What 's more, I can help others understand the results and take appropriate action regarding them.

10-10-2022

Deep Linear Models

13-11-2021

Double Descent

Hugo-Bowne-Andersondata scientist - educator - writer - podcaster

other initiatives

I'm interested in exploring other ways to teach and discuss data science, machine learning, and AI. To this end, I piloted a series ofFacebook Live coding sessions at DataCamp, which saw up to 40K unique viewers. Two of my favourites areGetting Started with the Tidyverse through the Titanic data set andWeb Scraping & NLP in Python, in which I scrape novels from the web and plot word frequency distributions.

I enjoy writing tutorials. You can find a bunch I've written onDataCamp's community page by searching for my name. Here are a few to get started with:

Groupby, split-apply-combine and pandas Hierarchical indices, groupby and pandas Preprocessing in Data Science (Part 1)Preprocessing in Data Science (Part 2)Preprocessing in Data Science (Part 3)

I'm constantly thinking about how data science notebook technologies can be used to design productive educational environments. You can check out Eric Ma's and my interactive Jupyter notebooks for our Bayesian data science workshopshere on Binder (more context in the GitHub repohere). I also built a DataCamp project that leverages the capabilities of Jupyter notebooks to create a novel educational experience: it's called"Word Frequency in Moby Dick" and in it, you'll get to scrape the novel Moby Dick from the website Project Gutenberg (which contains a large corpus of books), extract words from it, and dive into analyzing the distribution of words using the Natural Language Toolkit (nltk).

I've given a lot of webinars for business leaders, managers, and learning and development leaders across several verticals. Highlights include:What Managers Need To Know About Machine Learning,Inside the Data Science Workflow andData Literacy in the 21st Century.

Selected Talks

Bayesian Data Science Two Ways: Simulation and Probabilistic Programming

SciPy 2018 Tutorial

This was a tutorial that I co-taught with Eric Ma to build participants' knowledge of Bayesian inference, workflows and decision making under uncertainty. We started with the basics of probability via simulation and analysis of real-world datasets, building up to an understanding of Bayes' theorem. We then introduced the use of probabilistic programming to do statistical modelling. Throughout this tutorial, we used a mixture of instructional time and hands-on time. During instructional time, we used a variety of datasets to anchor our instruction; during hands-on time, which immediately followed instructional time, our participants applied the concepts learned to the Darwin's finches dataset, which permeated the entire tutorial.

Tutorial material

Bayesian Data Science by Simulation

PyCon 2019 Tutorial

This tutorial was an Introduction to Bayesian data science through the lens of simulation or hacker statistics. Learners became familiar with many common probability distributions through i) matching them to real-world stories & ii) simulating them. They worked with joint/conditional probabilities, Bayes Theorem, prior/posterior distributions and likelihoods, while seeing their applications in real-world data analyses. They then saw the utility of Bayesian inference in parameter estimation and comparing groups and we wrapped up with a dive into the wonderful world of probabilistic programming using PyMC3.

Tutorial material

bayesian-stats-modelling-tutorialPublic

How to do Bayesian statistical modelling using numpy and PyMC3

659 279

Jupyter Notebook

Sanjeev Sharma I'm an Entreprenuer| Researcher

As the founder of Swaayatt Robots and Deep Eigen, I focus on developing cutting-edge algorithms for autonomous vehicles, enabling them to navigate highly complex and unpredictable environments.

Kanaka Rajan I'm an Entreprenuer| Researcher

Both lectures are available on the COSYNE YouTube channel (see lecture title links) under a Creative Commons license. To request access to the lecture slides, please email: kanaka_rajan@hms.harvard.edu & kanaka-admin@stellatecomms.com

If you 'd like to deepen your understanding of recurrent neural networks, I encourage you to complete a problem set created in collaboration with the COSYNE Tutorial TAs. The problem set has detailed instructions and questions to work through. Problems 1 and 2 are intermediate and should be done after watching Lecture 1; Problem 3 is advanced and should be done after watching Lecture 2. Solutions are available in Julia, MATLAB, and Python.

Solution Scripts

John Parkhill ML, director of machine learning Terray Therapeutics (https://x.com/Terray_Tx).

Followings of @memming on X

Jaivardhan Kapoor

@_Jaivardhan_

Followings of @_Jaivardhan_ on X

ML, director of machine learning at Terray Therapeutics (x.com/Terray_Tx). Father. NSF CAREER award giver-upper. Gibe and gambol enjoyer.

Nov 19, 2021
Coin Vol-II Hedging your BTC/ETH - The basics
Nov 13, 2021
Coin Volatility Surfaces
Sep 5, 2021
The Crypto-Carry Trade
Aug 5, 2021
Woo, Quantum Storytelling, Time Crystals and Misallocation
May 30, 2021
Pricing Options with TorchSDE
Apr 30, 2021
Solving multidimensional PDEs in pytorch
Apr 7, 2021
Using simple mean-reversion to remove carry from a VIX futures position
Mar 11, 2021
Moving on from the Macbook Pro
Mar 6, 2021
Simply Extracting information out of your Stochastic Series
Jan 6, 2021
Copulas made simple with Pytorch Distributions
Dec 5, 2020
How to Size Bets. The Kelly Criterion in PyTorch
Nov 5, 2020
The #1 Reason to have a 3D Printer.
Oct 5, 2020
Modest Proposals to Make Science Great Again.
Sep 21, 2020
Some Robot Art
Sep 5, 2020
Quaternion Averaging in Pytorch
Aug 5, 2020
Yet another static blog!

Sep 5, 2020

Quaternion Averaging in Pytorch: Detailed Page

At atomsandbits.ai we implement some seriously large formulas in TensorFlow. If we just went from LaTeX to tf. we wouldn't be able to do it. Here's a list of tricks and tools we use, applied to the problem of averaging rotations. Come for the tf. stay for the hypersphere.

The tensormol0.2 model chemistry reproduces a huge swath of chemistry (37 elements), which is in some sense a large fraction of our world. It's a big ole' formula for some geometry:

TensorMol

How does one use TensorFlow effectively to get something complicated done? It's not easy. I thought I'd write up an example a little simpler than modeling all of chemistry. How about averaging rotations/axis systems? Simple right? Well interesting story… The math is mostly due to Hamilton (~1843), however it wasn’t until the advent of computer graphics in 1985 that people even bothered to work out how to interpolate between rotations perfectly.

#Rotation & Quaternions

The rotational algebra of our world is a beautiful bedeviling thing. The reason is that although rotations act on a three dimensional space, when embedded in three dimensions, rotations are not smooth or unique. When represented with Euler angles or matrices, every rotation has multiple representations. Change the order of rotations and you also change the endpoint (rotations are non-commutative) Traveling smoothly along some paths of rotations using a three dimensional embedding, suddenly the third degree of freedom can become inaccessible (the phenomenon of “Gimbal lock”). If you try to define an average or interpolated point-of-view in a naive way (axes=> angles => interpolated angles) you will find gibberish zero axes, and jerky non-smooth behavior.

The rotational algebra of our world is a beautiful bedeviling thing. The reason is that although rotations act on a three dimensional space, when embedded in three dimensions, rotations are not smooth or unique. When represented with Euler angles or matrices, every rotation has multiple representations. Change the order of rotations and you also change the endpoint (rotations are non-commutative) Traveling smoothly along some paths of rotations using a three dimensional embedding, suddenly the third degree of freedom can become inaccessible (the phenomenon of “Gimbal lock”). If you try to define an average or interpolated point-of-view in a naive way (axes=> angles => interpolated angles) you will find gibberish zero axes, and jerky non-smooth behavior.

To have smooth topology rotations must be embedded within a four-dimensional hypersphere, so we can forgive your brain. In this space a rotation is a 4-dimensional point, a quaternion, whose components can be thought of as the angle and 3 axis components of the rotation. Given a 3x3 rotation matrix Q, one can parameterize a quaternion (w,x,y,z)

Given any set of orthogonal axes (rows of Q), Euler's theorem guarantees an axis-angle rotation which can map the natural xyz axes back and forth into the new frame. The formula above yields the natural 4-d form of that rotation.

Now suppose you have two, three or four systems of axes (ax_1, ax_2, ax_3). For example you want to look at the sun then the moon, or you want to fit 4 pretty objects in your field of vision, or define invariant axes for a cloud of points (the reason we use this math in TensorMol). Can you simply average the quaternion components q_av = (ax_1+ax_2+ax_3)/3? Sadly no… You can immediately understand why if you imagine averaging rotations around opposite axis vectors as an owl might when spinning his head. The “good, smooth” quaternions keep to the surface of the 4-d hypersphere (a curvy subset of 4d-euclidean space). To interpolate lines on that sphere, you can use SLERP. To average multiple quaternions we must construct the 4x4 matrix which is the outer product of the list of quaternions (Nx4) with itself, weighted if desired:

Q = q_{1}, q_{2}, . . . M = (w \cdot Q)^{t} Q

The largest eigenvector of this matrix is the desired average quaternion.

Implementing complex math in tf.

Again, my goal is to get rotationally invariant axes for a set of points which smooth, differentiable and local. I will walk through my whole implementation of this in tf. Step 1- Don't use tf. Write a simple test of your formulas in a notebook like math interface (mathematica, ipython/sage). Verify everything is working when you use all the fancy library routines tf. doesn't have (eigenvectors etc.). Here's what that looks like using mathematica.

qmrr

Those fancy manipulate sliders are a nice way to get tangible faith that the point cloud is rotationally invariant when transformed using an averaged axis system depending on points in the cloud. It remains for us to do this same thing in tf. Were' ready for step 2:


def slerp(v0, v1, t=0.05):
    """
    Interpolate between quaternions v0 and v1.
    """
    v0 = safe_normalize(v0)
    v1 = safe_normalize(v1)
    dot = tf.reduce_sum(v0*v1,axis=-1,keepdims=True)
    # If the dot product is negative, slerp won't take
    # the shorter path. Note that v1 and -v1 are equivalent when
    # the negation is applied to all four components. Fix by
    # reversing one quaternion.
    signflip = tf.where(tf.less_equal(dot,0.),-1.*tf.ones_like(dot),tf.ones_like(dot))
    v1 *= signflip
    dot *= signflip
    # Linear answer.
    linq = safe_normalize(v0 + t*(v1-v0))
    #
    sdot = tf.clip_by_value(dot,-1.0,1.0)
    theta_0 = tf.acos(sdot)
    theta = theta_0*t
    sin_theta = tf.sin(theta)
    sin_theta_0 = tf.sin(theta_0)
    s0 = tf.cos(theta) - dot * sin_theta / (sin_theta_0+1e-19)
    s1 = sin_theta / (sin_theta_0+1e-19)
    sq = safe_normalize((s0 * v0) + (s1 * v1))
    #
    DOT_THRESHOLD = 0.9995
    tdot = tf.concat([dot,dot,dot,dot],axis=-1)
    slerpd = tf.where(tf.greater(tdot,DOT_THRESHOLD),linq,sq)
    ttiled = tf.concat([t,t,t,t],axis=-1)
    slerpdorv1 = tf.where(tf.greater(ttiled,1.0-1e-14),v1,slerpd)
    return tf.where(tf.less(ttiled,1e-14),v0,slerpdorv1)
def sftpluswparam(x):
    return tf.log(1.0 + tf.exp(100. * x)) / 100.0
def RotToQuat(axes_):
    """
    axes is a ... X 3 3 tensor of axes
    this generates a ... X 4 tensor of quaternions.
    which are 1:1 with those axes.
    """
    w = (1./2.)*tf.sqrt(1e-15+tf.abs(1 + axes_[...,0, 0] + axes_[...,1, 1] + axes_[...,2, 2]))
    x = tf.sign(axes_[...,2, 1] - axes_[...,1, 2])*tf.abs(0.5*tf.sqrt(1e-15+tf.abs(1.0 + axes_[...,0, 0] - axes_[...,1, 1] - axes_[...,2, 2])))
    y = tf.sign(axes_[...,0, 2] - axes_[...,2, 0])*tf.abs(0.5*tf.sqrt(1e-15+tf.abs(1.0 - axes_[...,0, 0] + axes_[...,1, 1] - axes_[...,2, 2])))
    z = tf.sign(axes_[...,1, 0] - axes_[...,0, 1])*tf.abs(0.5*tf.sqrt(1e-15+tf.abs(1.0 - axes_[...,0, 0] - axes_[...,1, 1] + axes_[...,2, 2])))
    return tf.stack([w,x,y,z],axis=-1)
def QuatToRot(q):
    """
    a_ ... X 4 tensor of quaternions
    this generates a ... X 3 X 3 of rotation matrices.
    """
    tmp=tf.stack([1 - 2.*(q[...,2]*q[...,2] + q[...,3]*q[...,3]), 2*(q[...,1]*q[...,2] - q[...,3]*q[...,0]),
    2*(q[...,1]*q[...,3] + q[...,2]*q[...,0]),2*(q[...,1]*q[...,2] + q[...,3]*q[...,0]), 1 - 2.*(q[...,1]*q[...,1] + q[...,3]*q[...,3]),
    2*(q[...,2]*q[...,3] - q[...,1]*q[...,0]),2*(q[...,1]*q[...,3] - q[...,2]*q[...,0]), 2*(q[...,2]*q[...,3] + q[...,1]*q[...,0]),
    1 - 2.*(q[...,1]*q[...,1] + q[...,2]*q[...,2])],axis=-1)
    return tf.reshape(tmp,[-1,3,3])
def VectorsToOrient(v1,v2):
    v1n = safe_normalize(v1)
    v2n = safe_normalize(v2)
    v3 = safe_normalize(tf.cross(v1n, v2n)+tf.constant(np.array([0., 0., 1e-19]), dtype=tf.float64))
    # Compute the average of v1, v2, and their projections onto the
    # plane.
    v_av = (v1n + v2n) / 2.0
    v_av = safe_normalize(v_av)
    # Rotate pi/4 cw and ccw to obtain v1,v2
    first = TF_AxisAngleRotation(v3, v_av, tf.constant(-Pi / 4., dtype=tf.float64))
    second = TF_AxisAngleRotation(v3, v_av,tf.constant(Pi / 4., dtype=tf.float64))
    vs = tf.concat([first[:, tf.newaxis, :], second[:, tf.newaxis, :],v3[:, tf.newaxis, :]],axis=1)
    return vs
def VectorsToAxisQs(v1,v2):
    return tf.reshape(RotToQuat(VectorsToOrient(v1,v2)),(-1, 4))
def safe_normalize(x_):
    nrm = tf.clip_by_value(tf.norm(x_,axis=-1,keepdims=True),1e-36,1e36)
    nrm_ok = tf.logical_and(tf.not_equal(nrm,0.),tf.logical_not(tf.is_nan(nrm)))
    safe_nrm = tf.where(nrm_ok,nrm,tf.ones_like(nrm))
    return x_*tf.where(nrm_ok,1.0/safe_nrm,tf.zeros_like(nrm))
def safe_inv_norm(x_):
    nrm = tf.clip_by_value(tf.norm(x_,axis=-1,keepdims=True),1e-36,1e36)
    nrm_ok = tf.logical_and(tf.not_equal(nrm,0.),tf.logical_not(tf.is_nan(nrm)))
    safe_nrm = tf.where(nrm_ok,nrm,tf.ones_like(nrm))
    return tf.where(nrm_ok,1.0/safe_nrm,tf.zeros_like(nrm))
def safe_norm(x_):
    nrm = tf.clip_by_value(tf.norm(x_, axis=-1, keepdims=True), 1e-36, 1e36)
    nrm_ok = tf.logical_and(
        tf.not_equal(nrm, 0.), tf.logical_not(tf.is_nan(nrm)))
    safe_nrm = tf.where(nrm_ok, nrm, tf.zeros_like(nrm))
    return safe_nrm
with tf.Graph().as_default():        
    xyzs = tf.Variable(np.random.random((batch_size,MaxNAtom,3))*7.0 - 5.0)
    init = tf.global_variables_initializer()
    sess = tf.Session(config=tf.ConfigProto(allow_soft_placement=True))
    sess.run(init)
    print sess.run(xyzs[0,:2])
    print sess.run(VectorsToOrient(xyzs[:,0],xyzs[:,1]))
    print sess.run(RotToQuat(VectorsToOrient(xyzs[:,0],xyzs[:,1])))
    print sess.run(QuatToRot(RotToQuat(VectorsToOrient(xyzs[:,0],xyzs[:,1]))))

Style notes about the tf. code given above:

Each routine can be easily compared with the mathematica output, to quickly debug.
In general it is best to choose an order of dimensions for your tensor which goes (least often contracted,…,most often contracted), because several helpful tf. functions assume the first dimension of a tensor is the batch dimension.
tf.sqrt, and 1/(tensor) are both unstable operations in tf. They are unstable in a tricky way, because the implied chain-rule derivative graph (coming from tf.gradients(… op…, var) will still often evaluate NaN’s even when it appears that the arguments to the routine would always be in the well-behaved domain. One must make liberal use of tf.clip_by_value() , tf.where(), and infinitesimals to ensure both the routine and routine’s derivatives are well-behaved. Safe_norm is a good example.

Epilogue

So is all this serious rotational mathematics only good for defining axis systems for atomic positions. No! Facebook AI-Research and collaborators from the EPFL published a nice use of quaternions for skeletal motion planning last week

Christian S. Perone / Machine Learning (@polymtl/@UMontreal)

Greg Brockman President & Co-Founder @OpenAI

Greg Brockman's Blog

Jul 30, 2019

How I became a machine learning practitioner

For the first three years of OpenAI, I dreamed of becoming a machine learning expert but made little progress towards that goal. Over the past nine months, I've finally made the transition to being a machine learning practitioner. It was hard but not impossible, and I think most people who are good programmers and know (or are willing to learn) the math can do it too. There are many online courses to self-study the technical side, and what turned out to be my biggest blocker was a mental barrier — getting ok with being a beginner again.

Studying machine learning during the 2018 holiday season.

Early days

Continue reading →

Early days #

A founding principle of OpenAI is that we value research and engineering equally — our goal is to build working systems that solve previously impossible tasks, so we need both. (In fact, our team is comprised of 25% people primarily using software skills, 25% primarily using machine learning skills, and 50% doing a hybrid of the two.) So from day one of OpenAI, my software skills were always in demand, and I kept procrastinating on picking up the machine learning skills I wanted.

After helping build OpenAI Gym, I was called to work on Universe. And as Universe was winding down, we decided to start working on Dota — and we needed someone to turn the game into a reinforcement learning environment before any machine learning could begin.

Dota #

Time out #

After we lost two games in The International in 2018, most observers thought we'd topped out what our approach could do. But we knew from our metrics that we were right on the edge of success and mostly needed more training. This meant the demands on my time had relented, and in November 2018, I felt I had an opening to take a gamble with three months of my time.

I learn best when I have something specific in mind to build. I decided to try building a chatbot. I started self-studying the curriculum we developed for our Fellows program, selecting only the NLP-relevant modules. For example, I wrote and trained an LSTM language model and then a Transformer-based one. I also read up on topics like information theory and read many papers, poring over each line until I fully absorbed it.

It was slow going, but this time I expected it. I didn't experience flow state. I was reminded of how I'd felt when I just started programming, and I kept thinking of how many years it had taken to achieve a feeling of mastery. I honestly wasn't confident that I would ever become good at machine learning. But I kept pushing because… well, honestly because I didn't want to be constrained to only understanding one part of my projects. I wanted to see the whole picture clearly.

One important conceptual step was overcoming a barrier I'd been too timid to do with Dota: make substantive changes to someone else's machine learning code. I fine-tuned GPT-1 on chat datasets I'd found, and made a small change to add my own naive sampling code. But it became so painfully slow as I tried to generate longer messages that my frustration overwhelmed my fear, and I implemented GPU caching — a change which touched the entire model.

I had to try a few times, throwing out my changes as they exceeded the complexity I could hold in my head. By the time I got it working a few days later, I realized I'd learned something that I would have previously thought impossible: I now understood how the whole model was put together, down to small stylistic details like how the codebase elegantly handles TensorFlow variable scopes.

Grant Slatton

Formerly built the world's fastest filesystem at AWS, now the fastest spreadsheet at http://rowzero.com

Grant Slatton's Blog

Binary IQ — A model of LLM capability

Lightweight property-based testing at Row Zero — How we verify correctness

Rust Macros: Zero to Hero — A comprehensive guide on Rust macros

Algorithms we develop software by — Pathfinding applied to the software solution domain

Building Filesystems — High level ideas in filesystem design

Quasirandom sequences — Cool method to generate non-clumping random points

How to write complex software — A general method

Bureaulogy — The study of bureaucracy

A peasant's plight — On the shackling of the peasantry

Every Man his own API — A sociotechnological trend

Culture is a set of social Schelling points — Solving coordination problems in community-building

Portals are Undertheorized — The importance of arrival

Binary IQ — A model of LLM capability

Designing bug-proof engines — A spectrum of engineering philosophies

Accidental Urbanism — How I got into the scene

How to Bootstrap a Town — A modest plan

Sports vs Games — An aesthetic distinction

Nobody Cares — A rant about caring

Lightweight property-based testing at Row Zero — How we verify correctness

Rust Macros: Zero to Hero — A comprehensive guide on Rust macros

Algorithms we develop software by — Pathfinding applied to the software solution domain

Status among whom? — An essay about status relativism

Ghost Side Control Escape System (BJJ) — A video instructional on my preferred side control escape system

Building Filesystems — High level ideas in filesystem design

AI follows auditability — An essay about the order AI will move through the economy

Book List — Stuff I've read

Onsen Unreality — Our experience at an onsen 'theme park' in Tokyo

Tesla Full Self-Driving — My experience with FSD

Internet Fiction — Collection of amateur stories — mainly sci-fi — that I like

All the way down — Very short story about simulation

Story Ideas — A collection of premises for stories

Things I wish I knew earlier — Collection of stuff I would tell my younger self if I could

Road Width Extremism — In favor of narrow roads

Links to See Also — Other "small web" personal sites I recommend

HTML5 Canvas simulations — A collection of little HTML5 canvas demos

Twitter — Essay about how getting on Twitter unexpectedly added a lot of value to my life

Shuttle — A useful concurrency checker library we used to verify our filesystem at AWS

Quasirandom sequences — Cool method to generate non-clumping random points

Book Review: 'The Perfectionists: How Precision Engineers Created the Modern World' — Excellent book about the history of precision machining

Markdown-ish — Writing a Markdown(ish) parser with the nom library

Grant holding Sampson at sunset overlooking Puget Sound

Alex Nichol

AI researcher, hobby web developer, math geek. Constantly learning.

Alex Nichol Blog - Pickled ML

Posts

Jonathan Frankle

Chief AI Scientist at Databricks. Founding team at MosaicML. MIT/Princeton alum. Lottery ticket enthusiast. Working on data intelligence.

Rishit DagliRishit-dagli

CS + Math @UofT | AI Research, Qualcomm | Research ML, Vision UofT, Vector | RT @kubernetes 1.26-9

106 repositories891 followers

@Alexandre_Mutel

Director C#/.NET Tech Group at Unity, OSS, lang/compilers, GPU/sound, architecture 🏎️ Microsoft MVP, ex-demoscene PC/Amiga 🎆 Veggie 🌿, opinions are my own.

Followings of @nietras1 on X

Alexandre Mutel ▶ https://mastodon.social/@xoofx

@xoofx

xoofx Alexandre Mutel

Director C#/.NET Tech Group at Unity, OSS, lang/compilers, GPU/sound, architecture 🏎️ Microsoft MVP, ex-demoscene PC/Amiga 🎆 Veggie 🌿, opinions are my own.

88 repositories1.4k followers

2023

10x Performance with SIMD Vectorized Code in C#/.NET- Jul 9C#, .NET, x86, assemblerUse your CPU at its full width!

2020

Stark - Native Compiler - Prototype 2019- Mar 21Stark, Melody, OS, Compiler, LLVM, C#, .NETDevelopment of an AOT native compiler using RyuJIT
Stark - Language And Frontend Compiler - Prototype 2019- Mar 6Stark, Melody, OS, Compiler, LLVM, C#, .NETSyntax of the language and the development of the front-end compiler
The Odyssey of Stark and Melody- Mar 5Stark, Melody, OS, Compiler, LLVM, C#, .NETPrototyping a new language and OS with the help of the .NET ecosystem and seL4 micro-kernel

2018

Generate automatically async/await code from sync code with Roslyn- Dec 26C#, .NET, Roslyn
Writing a Managed JIT in C# with CoreCLR- Apr 12C#, .NET, CoreCLR
Porting the Unity Engine to .NET CoreCLR- Apr 6C#, .NET, CoreCLR, Unity
Productivity with ReSharper- Mar 9Visual Studio, Visual Studio 2015, Roslyn

2009

Random float number generator using x86 ASM code optimized in size- Oct 25assembler, x86

Alexandre Mutel (https://github.com/xoofx) starred a repository on 20/6/25
JimmyLefevre/kb (C) 275 STARS

Alexandre Mutel (https://github.com/xoofx) starred a repository on 10/5/25
metalama/Metalama (C#) 292 STARS

A meta-programming framework for code generation, aspect-oriented programming, and architecture verification of large C# codebases.

Followings of Gael Fraiteur on GitHub

Yan Cui

Hi there! I’m Yan Cui(theburningmonk).

Visit the full article: How to perform database migration for a live service with no downtime | Posts · AWS, DynamoDB, Serverless · 12/2023

Visit the full article: Building a custom IAM system has made me appreciate AWS IAM even more | Posts · AppSync, AWS, Lambda, Serverless · 12/2023

Alexandre Mutel (https://github.com/xoofx) followed a GitHub user on 2/5/25
meziantou

Alexandre Mutel (https://github.com/xoofx) starred a repository on 7/4/25
Alan-Rock-GS/GpuScript (C#) 171 STARS

Alexandre Mutel (https://github.com/xoofx) starred a repository on 5/4/25
https://github.com/nietras/Llm.cs (C#) 49 STARS

Overview of commits/PRs from Feb 1, 2025 to Feb 28, 2025

Backend URL Link https://github.com/xoofx?tab=overview&from=2025-02-01&to=2025-02-28

Remove `List<IObserver<T>>.ToArray()` allocations in `LightweightObservableBase`#18316

xoofx

Here are the details of a specific PR from the AvaloniaUI/Avalonia repository:

This PR is removing the List<IObserver<T>>.ToArray() allocations happening in LightweightObservableBase when Routing events are fired (e.g. whenever you move the mouse)

When profiling the memory, I noticed that when generating lots of routing events (e.g. just moving the mouse over a window) several MB of IObserver<ValueTuple<Object, RoutedEventArgs>>[] were created.

Bishal Santra Bishal Santra

Research Engineer @microsoft Research India | Working on Language Modeling for Retrieval | IIT KGP

103 repositories36 followers

Aug 12, 2023
Explaining Issues with Channel Method in LLM Prompt-based Classification
Aug 12, 2023
Explaining Issues with Channel Method in LLM Prompt-based Classification
Feb 11, 2023
Prove that the set of algebraic numbers are countable (using primes)
Feb 10, 2023
Is ∑_i=1ⁿ 1/n divergent?
Oct 7, 2021
How to Connect through an SSH Jump Server (For CNeRG project students)
Oct 26, 2019
Transformer Language Models and Pretraining
Aug 31, 2019
English to Hindi Transliteration using Seq2Seq Model
Aug 31, 2019
Deep Sentiment Analysis
Jul 13, 2019
Text Classification using Naive Bayes Method
Jul 8, 2019
Training a Language Model with a Xtra-Small Transformer (Transformer-XS)
Jul 7, 2019
Simple Reddit Dialogue Preprocessor
Jul 3, 2019
How I created this site using Jekyll?
Jan 12, 2017
Generating Gamma Random Variable in CUDA in Parallel
Aug 6, 2016
Minimum Mean Squared Error (MMSE) Estimator
Mar 14, 2016
DecycledJSON - Circular reference breakers for JSON

Charlie Marsh

Building Astral: high-performance tools for Python, starting with Ruff.

In the past: Staff software engineer at Spring Discovery, senior engineer at Khan Academy, and Computer Science major at Princeton.

These days, I write on Notion.

Check out some of my public projects:

Hi, I'm Charlie Marsh.

I'm building high-performance developer tools for Python, starting with Ruff, an extremely fast Python linter written in Rust.

I was most recently a staff software engineer at Spring Discovery. Before that, I was a senior software engineer at Khan Academy.

This is a collection of notes and blog posts I’ve written on Notion:

🐍Using Mypy in production at Spring

🌐What’s WebAssembly?

🛠️Python tooling could be much, much faster

🤖Building large language model-powered applications

☁️Isolates, microVMs, and WebAssembly

⚡Ruff: The First 200 Releases

You can find me on Twitter.

For older posts and projects, check out my personal site.

Building a Really, Really Small Android App
Writing a Reproducible Test Plan
Reviewing Code from Both Sides
Getting up and Running with Robolectric
Learning Android in a Production Setting
Exploring Flow, Facebook's JavaScript Type Checker (JavaScript Weekly)
Bitcoin Script: An In-Browser Playground (Hacker News)
Speeding up SVGs with CSS Transforms at Khan Academy (Hacker News)
Rendering React Components on the Server
Styling React Components: How to Escape Selector Hell (talk delivered @ Khan Academy)
An Overly Thorough Guide to Python Class Attributes (Python Weekly)
A Primer on Computational Geometry in Python (Python Weekly)
Why Are There So Many Pythons? (Hacker News, Python Weekly, Pycoder's Weekly)
Compiling to JavaScript: A Case-by-Case Guide to the *Scripts
PhantomJS: Common Gotchas for Beginners
The Idiot-Proof Guide to Setting up Your Personal AWS Instance
An Introduction to Rule Optimization in Software Defined Networking
Prefix vs. Ternary Rules in SDN
Setting up an OCaml Environment on Mac OS X
Programming Languages and the Solutions they Suggest
First Thoughts on OCaml
A Primer on Streams and Lazy Computation
Binomial Heaps: Merge Better
Performance Improvements in iOS
NP != NOT P

kevin frans website v5

Hey, I'm Kevin. I am a PhD student at BAIRadvised by Pieter Abbeel andSergey Levine. I did my B.S. and M.Eng at MIT with Phillip Isola. I am interested in deep reinforcement learning, unsupervised learning, and AI-based creative tools. I also lead engineering at ParagraphAI. I have spent time atCross Labs, Sizigi,Autodesk Research, and OpenAI. In my free time, I like to design and build video games.

19 Dec 2023

Small-Research: Tanh Activations with DDPG

19 Dec 2023

Small-Research: Policy Extraction in IQL

8 Sep 2023

Successor Representations Explained

1 Aug 2023

Deriving the KL divergence loss in variational autoencoders

15 May 2022

For AGI, we need better tasks. For better tasks, we need open-endedness. (ALOE 2022 Notes)

6 Dec 2021

A Mathematical Definition of Interestingness

12 Nov 2021

To extract information from language models, optimize for causal response

2 Nov 2021

Data digesters, ML^2, Interestingness

28 Jun 2021

CLIPDraw: Exploring Text-to-Drawing Synthesis

3 Apr 2021

StampCA: Growing Emoji with Conditional Neural Cellular Automata

2 Dec 2020

Quality Diversity: Evolving Ocean Creatures

12 Nov 2020

Open-Endedness 3: Multicell World

22 Oct 2020

Open-Endedness 2: Bitwise Chemicals

15 Oct 2020

Open-Endedness 1: Cellworld

27 Nov 2019

Omakase 5: Bullet Dance

21 Jul 2019

Omakase 1: Ropeman

8 Apr 2019

Linking C++ and Python with Boost (Anaconda)

12 Jul 2018

RAIN Project: Evolution of the Game Development Dream

26 Oct 2017

[Link] Learning a Hierarchy

28 Feb 2017

Deepcolor: Automatic Coloring and Shading of Manga-Style Lineart

29 Dec 2016

What is the Natural Gradient, and How Does it Work?

8 Dec 2016

Speeding Up TRPO Through Parallelization and Parameter Adaptation

5 Dec 2016

Colorizing the DRAW Model

1 Jul 2016

Simple Reinforcement Learning Methods to Learn CartPole

28 Jun 2016

Generative Adversarial Networks Explained

20 Jun 2016

Making Use of the Model

15 Jun 2016

The Policy Gradient

15 Jun 2016

Visualizing Features from a Convolutional Neural Network

7 Jun 2016

Model-Free Prediction and Control

5 Jun 2016

Planning: Policy Evaluation, Policy Iteration, Value Iteration

5 Jun 2016

Markov Processes in Reinforcement Learning

5 Jun 2016

Reinforcement Learning Basics

6 Apr 2016

Neural Style Explained

Stathis Kamperis Stathis Kamperis

I am a radiation oncologist and physicist. I like to build bridges between different scientific disciplines (medicine, physics, informatics).

15 repositories29 followers

Sewon Min Incoming faculty @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp.

@sewon__min

Incoming faculty

@Berkeley_EECS

@berkeley_ai

|| Research scientist at

@allen_ai

|| PhD from

@uwcse

@uwnlp

11/2024: I won't be attending EMNLP or NeurIPS this year, but my co-authors will be presenting our work! Check out our papers onBenchmarking the Reproduction of Copyrighted Text^{(EMNLP Main, NeurIPS Regulatable ML Workshop Contributed Talk)},Scaling a Datastore in Retrieval-Based LMs^{(NeurIPS Main)}, andAn Open Mixture-of-Experts LM^{(NeurIPS Workshop on Efficient Natural Language and Speech Processing (ENLSP), Oral Talk)}.

11/2024: I am recruiting PhD students at UC Berkeley's EECS!If you're interested, please apply directly through the UC Berkeley admissions portal (details here). Kindly note that I cannot discuss applications outside the official admissions process.

12/2023: I am attending EMNLP and NeurIPS! At EMNLP, I will give an invited talk on Rethinking the Role of Demonstrations at the Big Picture Workshop on Dec 7th, and give an oral talk on FActScore on Dec 8th. At NeurIPS, I will give a spotlight talk on SILO at the Distribution Shifts Workshop on Dec 15th, and give an oral talk on SILO at the Regulatable ML Workshop on Dec 16th.

08/2023: Together with Suchin Gururangan, we present SILO, proposing to segregate the training data and the inference-time data in nonparametric LMs to mitigate legal risk in LMs.

07/2023: Our paper that examines the role of demonstrations in CoT prompting, led by Boshi Wang, won an Honorable Mention at ACL 2023.

07/2023: I co-taught a tutorial on retrieval-based LMs at ACL 2023. Slides & recordings are available on the website.

12/2022: Check out our new preprint,Nonparametric Masked Language Modeling. Code and model checkpoints available here.

09/2022: I was selected by the EECS Rising Stars Program.

08/2022: Together with Sang Michael Xie, we wrote a post on How does in-context learning work? A framework for understanding the differences from traditional supervised learning at Stanford AI Blog.

05/2022: I co-taught the ACL tutorial on Few-Shot NLP with Pretrained Language Models (slides, recordings).

02/2022: Check out our new preprint, Rethinking the Role of Demonstrations: What makes In-context Learning Work?All experiments reproducible from this code. (Update 10/2022: The paper was accepted to EMNLP 2022.)

02/2022: I am co-organizing two workshops at ACL 2022: Repl4NLP (CFP) andSpa-NLP (CFP).

10/2021: Our new preprint, MetaICL: Learning to Learn In Context is out (w/ code). Check out the demo! (Update 04/2022: The paper was accepted to NAACL 2022.)

08/2021: Our new preprint, Noisy Channel Language Model Prompting for Few-Shot Text Classification is out w/ code! (Update 02/2022: The paper was accepted to ACL 2022.)

07/2021: Our new preprint, FaVIQ: FAct Verification from Information-seeking Questions is out! Visit FaVIQ website to download data and see samples. (Update 02/2022: The paper was accepted to ACL 2022.)

07/2021: I am co-organizing The 2nd Workshop on Unstructured/Structured KBs, hosted at AKBC 2021.

06/2021: I co-taught the NAACL-HLT tutorial on Beyond Paragraphs: NLP for Long Sequences.

04/2021: Our new preprint, Joint Passage Ranking for Diverse Multi-Answer Retrievalis out! This is done as part of my internship at Google. (Update 08/2021: The paper was accepted to EMNLP.)

01/2021: We, the NeurIPS 2020 EfficientQA organizers, together with participants, wrote NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned. The video of the NeuIPS event is also available here. (Update 05/2021: The paper was accepted to PMLR.)

12/2020: I am co-organizing The 3rd Workshop on Machine Reading for Question Answering, hosted at EMNLP 2021. Stay tuned for Call for papers!

09/2020: I made an Open-domain QA Demo using DPR. Give it a try!

06/2020: I am co-organizing Competition on Efficient Open-Domain Question Answering, hosted at NeurIPS 2020. [leaderboard]

06/2020: I am co-organizing Workshop on Unstructured/Structured KBs, hosted at AKBC 2020.

04/2020: Our new preprint, AmbigQA: Answering Ambiguous Open-domain Questionsis out! Visit AmbigQA website to download data and see samples.

04/2020: Our new preprint, Dense Passage Retrieval for Open-domain Question Answeringis out (w/ code)!

Kristoffer Carlsson

Software engineer, Julia Computing

SIMD and SIMD-intrinsics in Julia

Tue, Nov 13, 2018simd,intrinsics,julia

Short guide on SIMD and how to call (SIMD) intrinsics in the Julia programming language.

Case study: Improving performance of a code written in Matlab style

Mon, Dec 26, 2016performance,vectorization,loops,julia

Analysis and optimization of a small code snippet posted on the Julia discourse mailing list.

Here are some of the open source projects I have created or been involved with:

Pkg.jl – Julia’s package manager.
NearestNeighbors.jl – High performance nearest neighbor data structures and algorithms.
Tensors.jl – Efficient computations with symmetric and non-symmetric tensors with support for automatic differentiation.
OhMyREPL.jl – Syntax highlighting and other enhancements for the Julia REPL.
Crayons.jl – Colored and styled strings for terminals.
PGFPlotsX.jl – Seamlessly create plots in Julia using the PGFPlots LaTeX package.
Pardiso.jl – Calling the PARDISO sparse solver library.
Tokenize.jl –Tokenization for Julia source code.
TimerOutputs.jl – Formatted output of timed sections.
BlockArrays.jl – Interface for blocked arrays.
Distances.jl – A Julia package for evaluating distances(metrics) between vectors.
NLsolve.jl – Julia solvers for systems of nonlinear equations and mixed complementarity problems
MMA.jl – The “Method of Moving Asymptotes”-algorithm. (old package)

Danielle Navarro

Hi there! I’m Danielle Navarro. I’m a data scientist,generative artist, and a recovering academic living in Sydney with my two kids and a Netflix subscription. Once upon a time I was a mathematical psychologist. After that I was developer advocate and occasional software engineer. I’ve sometimes been accused of being a statistician.

djnavarro Danielle Navarro

Data scientist. Former academic. Occasional generative artist

233 repositories1.1k followers

Contact details, social media, etc

Email: djnavarro@protonmail.com
GitHub: github.com/djnavarro
Mastodon: hachyderm.io/@djnavarro
LinkedIn: linkedin.com/in/djnavarro
Orcid: orcid.org/0000-0001-7648-6578
Scholar: scholar.google.com/citations?user=QPH_lRIAAAAJ

Notes from a data witch

A blog by Danielle Navarro

Hi there! I’m Danielle Navarro.
A blog by Danielle Navarro

Yihui Xie

Hi there! I’m Yihui Xie. I’m a Freelancer (open source programmer, contractor, blogger, and writer)

I’m currently a freelancer, and was a software engineer at Posit Software, PBC (2013-2023). I earned my PhD from the Department of Statistics, Iowa State University. My thesis was DynamicGraphics and Reporting for Statistics, advised by Di Cook and Heike Hofmann. I have developed a series of R packages either seriously or forfun (or both), such aslitedown, knitr, animation,bookdown,blogdown,pagedown,xaringan, and tinytex. I founded a Chinese website called “Capital of Statistics” in 2006, which has grown into a large online community on statistics. I initiated the China R conference in 2008. I’m a big fan ofGitHub, LyX andPandoc. I used to hate IE but no longer care since it has almost died. I fall asleep when I see beamer slides, and I yell at people who use \textbf to write \title. I know I cannot eat code, so I cook almost every day to stay away from my computer for two hours.

Author: Yihui Xie
I was introduced to this Author by this Mastodon post -hachyderm.io/@djnavarro/113477662963181887
by the Authorhachyderm.io/@djnavarro

yihui Yihui Xie

Hi there! I’m Yihui Xie. I’m a Freelancer (open source programmer, contractor, blogger, and writer)

89 repositories9.6k followers

2018

2018-07-30 Solving Statistical Computing Problems with SQL

ThomasLumley on Mastodon and Blu Esky

I got this tweet: Thomas Lumley on X: "Blog post, software, and preprint for my #JSM2018 talk/poster"
from this Author: Yihui Xie
from his blog post: Solving Statistical Computing Problems with SQL - Yihui Xie | 谢益辉

Faster generalised linear models in largeish data

2018/03/05

Thomas Lumley

@tslumley

Prof Richard Xu

Hi there! I’m Prof Richard Xu. I’m a
I am a Professor at the Department of Mathematics, Hong Kong Baptist University (HKBU) 香港浸会大学数学系教授

roboticcam Prof Richard Xu 徐亦达教授

I am a Professor at the Department of Mathematics, Hong Kong Baptist University (HKBU) 香港浸会大学数学系教授

13 repositories5.1k followers

machine-learning-notesPublic

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习，概率模型和深度学习的讲义(2000+页)和视频链接

8.5k 1.7k

Jupyter Notebook

Course on Foundational Mathematics in Machine Learning 机器学习基础数学课程

Course on Intemediate Mathematics in Machine Learning 机器学习中级数学课程

Sinovation DeeCamp 创新工场DeeCAMP讲义

Deep Learning Research Topics 深度学习研究

Optimization Method 优化方法

Deep Learning Basics 深度学习基础

Restricted Boltzmann Machine

3D Geometry Computer vision 3D几何计算机视觉

Reinforcement Learning 强化学习

Natural Language Processing 自然语言处理

Data Science PowerPoint and Source Code 数据科学 PowerPoint 和源代码

Probabilistic Model 概率模型课件

Monte-Carlo Inference 蒙特卡洛推理

Advanced Probabilistic Model 高级概率模型课件

Alexander Fischer

Hi there! I’m Alexander Fischer.
Data Scientist @trivago

juanitorduz Juan Orduz

Mathematician & Data Scientist

34 repositories577 followers

follows

s3alfisc Alexander Fischer

Data Scientist @trivago

43 repositories81 followers

Ross Wightman

Hi there! I’m Ross Wightman.
Computer Vision @huggingface. Always learning, constantly curious. Building ML/AI systems, watching loss curves.

rwightman Ross Wightman

Computer Vision @huggingface. Always learning, constantly curious. Building ML/AI systems, watching loss curves.

74 repositories6.7k followers

Miles Cranmer

Hi there! I’m Miles Cranmer.

milescranmer Miles Cranmer

253 repositories1.6k followers

Overview of commits/PRs from Mar 1, 2025 to Mar 31, 2025

Backend URL Link https://github.com/MilesCranmer?tab=overview&from=2025-03-01&to=2025-03-31

fix: comparison operator parsing#845

MilesCranmer

Here are the details of a specific commit from the PySR repository:

Pull Request Test Coverage Report given in #845

Changed Files:

File Name or File Path

Overview of commits/PRs from Mar 1, 2025 to Mar 31, 2025

Backend URL Link https://github.com/MilesCranmer?tab=overview&from=2025-03-01&to=2025-03-31

Create benchmark suite#1084

MilesCranmer

Here are the details of a specific commit from the JuliaNLSolvers/Optim.jl repository:

Pull Request Test Coverage Report given in #1084

This creates a simple benchmark for catching performance regressions on small, tightly controlled problems. To kick things off I added the multivariate first-order optimizers includingAdam,AdaMax,BFGS,LBFGS,NGMRES,ConjugateGradient,GradientDescent, and MomentumGradientDescent.

I also add a GitHub action to run AirspeedVelocity.jl on this benchmark for any new PR. It will automatically print out the performance and load time comparison of master in a GitHub comment on the PR.

Patrick Kofod Mogensen

hey, it works :) https://github.com/JuliaNLSolvers/Optim.jl/actions/runs/14055921576/job/39355263452?pr=1138

DavisVaughan

Hi there! I’m DavisVaughan.

hamelsmu HamelHusain

373 repositories2.1k followers

https://github.com/hamelsmu?tab=following

cderv Christophe Dervieux

321 repositories578 followers

https://bsky.app/profile/cderv.bsky.social

https://bsky.app/profile/davisvaughan.bsky.socialReference: Air, an extremely fast R formatter by Davis Vaughan

DavisVaughan DavisVaughan

452 repositories826 followers

Amazon RDS + R

May 10, 2017

Jon Shlens

Hi there! I’m Jon Shlens.

rwightman Ross Wightman

Computer Vision @huggingface. Always learning, constantly curious. Building ML/AI systems, watching loss curves.

74 repositories6.7k followers

rwightman JonShlens

1 repository86 followers

Reference: PR - Odd batch_size specific behaviour with nasnet_large on ImageNet validation #2778

Tutorials

These tutorials provide a general introduction to topics I find quite interesting but often lack good explanations in textbooks or the online literature.

Tutorial on Independent Component Analysis

A complete introduction and discussion of independent component analysis. Builds on previous tutorial on principal component analysis.

Version 1.0

Tutorial on Principal Component Analysis

A full introduction, description, derivation, and discussion of principal component analysis. Concrete examples for intuition building, the mathematical relation to SVD, and new extensions of this algorithm.

Version 3.02

A Light Discussion and Derivation of Entropy

A light discussion of the underlying assumptions behind entropy followed by a rigorous but simple derivation of the formula for entropy.

Version 1.01

Notes on Kullback-Leibler Divergence and Likelihood

An intuitive discussion about where Kullback-Leibler divergence arises and its relationship to likelihood theory.

Version 1.01

Notes on Generalized Linear Models of Neurons

An introduction to the application of GLMs to model neurons and networks of neurons. Brief discussion and derivation of primary equations pertaining to maximum likelihood estimation.

Version 1.51

Yixuan Qiu

Hi there! I’m Yixuan Qiu.
Currently an associate professor in School of Statistics and Management atShanghai University of Finance and Economics (SUFE).

djnavarro Danielle Navarro

Data scientist. Former academic. Occasional generative artist

233 repositories1.1k followers

yihui Yihui Xie

Hi there! I’m Yihui Xie. I’m a Freelancer (open source programmer, contractor, blogger, and writer)

89 repositories9.6k followers

Reference: Mastodon post - hachyderm.io/@djnavarro/113477662963181887

yihui Yihui Xie

Hi there! I’m Yihui Xie. I’m a Freelancer (open source programmer, contractor, blogger, and writer)

89 repositories9.6k followers

yixuan Yixuan Qiu

96 repositories841 followers

Reference: Yihui Xie's post - Solving Statistical Computing Problems with SQL - Yihui Xie | 谢益辉

Steven G. Johnson

Hi there! I’m Steven G. Johnson.
Professor of Applied Mathematics and Physics, Massachusetts Institute of Technology

stevengj Steven G. Johnson

Professor of Applied Mathematics and Physics

152 repositories1.3k followers

Overview of commits/PRs from Feb 1, 2025 to Feb 28, 2025

Backend URL Link https://github.com/stevengj?tab=overview&from=2025-01-01&to=2025-01-31

document/export chebvandermonde#24

Steven G. Johnson

Here are the details of the commit for this JuliaMath/FastChebInterp.jl repository:

https://discourse.julialang.org/t/multivariate-polynomial-regression-of-discrete-data-in-l-infinity-norm/125369/7?u=stevengj

https://gitlab.com/nsajko/FindMinimaxPolynomial.jl

https://xn--2-umb.com/22/approximation/

Overview of commits/PRs from July 1, 2023 to July 31, 2023

Backend URL Link https://github.com/stevengj?tab=overview&from=2023-07-01&to=2023-07-31

Random.randcycle(1) should throw?#50479

stevengj

I found another paper on Sattolo's algorithm that defines cyclic permutations in a different way, which allows the identity permutation only for n=1:

`AutoModel` class for `image-text-to-text` models#32042

merveenoyan

Thomas Stringer

Hi there! I’m Thomas Stringer.

Call for maintainers! #148.

Who should be a maintainer? Somebody with GitHub Actions experience, or the desire to obtain that experience. Also a maintainer should be a modern code craftsperson that is passionate about shipping production-quality software. This GitHub Action can be part of important build and deployment pipelines. Not to mention, it is likely running inside many existing users' environments in their runners. It is important that changes are well-tested, and are the right thing for our users.

Manual Approval in a GitHub Actions Workflow
Posted: March 28, 2022
Updated: March 28, 2022
Visit the full article here

Visit the full article: Manual Approval in a GitHub Actions Workflow | Posts · 28/3/2022 · 4 minutes

Open an issue on the trstringer/manual-approval repository

Martin Evans

Hi there! I’m Martin Evans.

http://martindevans.me

LLamaSharp

LLamaSharp is a C# wrapper around llama.cpp. This is not my project alone, but I became one of the lead maintainers last year and I've continued working on it this year.

In 2024 my major contribution to LLamaSharp was the development of the BatchedExecutor which is an entirely new low-level abstraction around language models. The BatchedExecutor is designed to expose all of the power of llama.cpp in a safe way, for example multiple parallel sequences evaluated all together in one batch is as simple as:

Sequences can be easily saved and loaded, forked into 2 sequences with the same prefix (which internally share the same space in memory), the KV cache can be accessed and manipulated (e.g. to implement context shifting), sequences can even be prompted with embeddings directly which allows things like LLava.

My long term goal for 2025 is to rewrite many of the higher level parts of LLamaSharp to operate on top of the BatchedExecutor, this will reduce the overall complexity of the project by implementing it all in one place and should offer more power to advanced users since they can always build on top of BatchedExecutor instead of using the low level llama.cpp primitives.

Martin Evans

Hi there! I’m Martin Evans.

Martin Evans (https://github.com/martindevans) starred a repository on 5/4/25
https://github.com/MerlinVR/UdonSharp (C#) 706 STARS

Contributor Rankings

#1 MerlinVR - 1035 commits | GitHub Profile
#2 Momo the Monster - 54 commits | GitHub Profile

momo-the-monster Momo the Monster

80 repositories50 followers

Followings of Momo the Monster(momo-the-monster) on GitHub

Followings of Jeremy Cowles (jcowles) on GitHub

pixeljetstream Christoph Kubisch

NVIDIA

16 repositories89 followers

Visit the full article: Life of a triangle - NVIDIA's logical pipeline | Posts · 6/02/2016

Cédric Luthi

Hi there! I’m Cédric Luthi.

@0xced@hachyderm.io

Philipp Wagner (https://github.com/bytefish) followed Cédric Luthi (https://github.com/0xced) on 25/1/25.

Visit the @0xced/114309797988146204 post page on Hachyderm, which references the relevant issue on GitHubServiceBusAdministrationClient support #17. The posted date is 10/4/25.

In the discussion onGitHub Link - https://github.com/Azure/azure-service-bus-emulator-installer/issues/17#issuecomment-2790842139, a user expressed difficulties encountered while attempting to install and run the Azure Service Bus Emulator. They reported persistent errors that persisted despite following the provided installation instructions. The community responded with suggestions to verify system requirements and permissions, encouraging further dialogue to troubleshoot and resolve these issues collaboratively.

GitHub - 0xced/Chisel: Remove Unwanted Dependencies from Your .NET Projects

Remove Unwanted Dependencies from Your .NET Projects - 0xced/Chisel

Visit the 0xced/Chisel page on Hachyderm

Mark Heath

Hi there! I’m Mark Heath.

GitHub - markheath/azure-functions-links: Useful Links for Azure Functions

Useful Links for Azure Functions - markheath/azure-functions-links

Visit the Azure Functions Links GitHub Repository

azure-functions-links Public

Useful links for Azure Functions.

214 36

Danilo Poccia

Hi there! I’m Danilo Poccia.

GitHub - danilop/AWS_Lambda_in_Action: This source code distribution is a companion to the AWS Lambda in Action book available from Manning Publications.

https://www.manning.com/books/aws-lambda-in-action

Visit the AWS_Lambda_in_Action GitHub Repository

AWS_Lambda_in_Action Public

This source code distribution is a companion to the AWS Lambda in Action book available from Manning Publications.

287 122

JavaScript

New – A Shared File System for Your Lambda Functions

https://aws.amazon.com/blogs/aws/new-a-shared-file-system-for-your-lambda-functions/

by Danilo Poccia on 16 JUN 2020 in Amazon Elastic File System (EFS), Announcements, AWS Lambda, Compute, Launch, News, Serverless, Storage

Visit the New – A Shared File System for Your Lambda Functions Blog Post

Fei Peng

Hi there! I’m Fei Peng.

Hardware Intrinsic in .NET Core 3.0 - Introduction

https://fiigii.com/2019/03/03/Hardware-intrinsic-in-NET-Core-3-0-Introduction/

by Fei Peng on 2019-03-03 in .NET Core, SIMD, x86

Visit the Hardware Intrinsic in .NET Core 3.0 - Introduction Documentation

API Proposal: Add Intel Hardware Intrinsic Functions and Namespace #23057

https://github.com/dotnet/runtime/issues/23057

Repository: github.com/dotnet/runtime

Visit the GitHub Issue for API Proposal: Add Intel Hardware Intrinsic Functions and Namespace

PacketTracer Public

The SIMD-accelereted ray tracing in C# powered by Intel hardware intrinsic of .NET Core.

113 9

Timur Iskhakov

Hi there! I’m Timur Iskhakov.

On April 20, 2025, a Reddit user shared their excitement about completing their first significant AI project in C#, which utilized ONNX (Open Neural Network Exchange). They expressed how impressed they were by the capabilities of the ONNX framework, highlighting its ability to streamline model training and deployment across various platforms. The post detailed their journey through the project, including the challenges they faced and the solutions they discovered. The author encouraged others in the community to explore ONNX for their own AI endeavors, noting its versatility and the positive impact it had on their workflow. The enthusiasm radiating from their experience resonated with fellow enthusiasts, sparking discussions and sharing of similar projects.Link to the post - Posted on 20/4/25.

https://www.reddit.com/r/csharp/comments/1k37gj7/my_first_big_ai_project_in_c_onnx_blown_away_by/

My biggest tip is to do as much as possible on the GPU—I use ILGPU to do this, but you could also use something like compute shaders in Silk.NET, OpenTK, or ComputeSharp. — nullandkale, posted on 21/4/25

I searched in Microsoft Bing Browser with the query "ilgpu c#" and found these helpful results:Computing the Convex Hull on GPU andVectorized Computations and SIMD.

ComputingTheConvexHullOnGpu Public

Computing the Convex Hull on GPU

5 0

Visit the full article: Exploring Spans and Pipelines | Improving the performance of file parsing by using new goodies in .NET Core | Posts · 31/10/2019 · 5 minutes · c# cuda algorithms

Kristoffer Carlsson

Software engineer, Julia Computing

SIMD and SIMD-intrinsics in Julia

Visit the Blog Post titled SIMD and SIMD-intrinsics in Julia

Posted on Tue, Nov 13, 2018 simd,intrinsics,julia

Philipp Wagner

Hi there! I’m Philipp Wagner.

GitHub - bytefish/facerec: Implements face recognition algorithms for MATLAB/GNU Octave and Python.

Implements face recognition algorithms for MATLAB/GNU Octave and Python

Advanced Examples: Building your own PredictableModel

Basically all face recognition algorithms are the combination of a feature extractionand a classifier. The Eigenfaces method for example is a Principal Component Analysis with a Nearest Neighbor classifier. Local Binary Patterns Histograms . The feature (which must be an AbstractFeature) and the classifier (which must be an AbstractClassifier) form a PredictableModel, which does the feature extraction and learns the classifier.

facerec Public archive

Implements face recognition algorithms for MATLAB/GNU Octave and Python.

941 472

Python

Elasticsearch

Elasticsearch. The heart of the Elastic Stack

Elasticsearch is an open source distributed, RESTful search and analytics engine, scalable data store, and vector database capable of addressing a growing number of use cases. As the heart of the Elastic Stack, it centrally stores your data for lightning-fast search, fine‑tuned relevancy, and powerful analytics that scale with ease.

ItamarSyn-Hershko

Hi there! I’m ItamarSyn-Hershko.
CTO & Founder of BigData Boutique. Search, BigData and Cloud expert - making the cloud a better place day by day.

AndrewLock

Hi there! I’m AndrewLock.

Ivan Cesar

Hi there! I’m Ivan Cesar.

AndrewLock

Hi there! I’m AndrewLock.

Writing Logs to Elasticsearch with Fluentd using Serilog in ASP.NET Core

https://andrewlock.net/writing-logs-to-elasticsearch-with-fluentd-using-serilog-in-asp-net-core/

Category: ASP.NET Core, DevOps, Logging, Docker

Published on: June 20, 2018

Estimated Reading Time: ~7 min read

Visit the full article: Writing Logs to Elasticsearch with Fluentd using Serilog in ASP.NET Core

serilog-aspnetcore Public

Serilog integration for ASP.NET Core

1.3k 207

aspnetcore, serilog, aspnet-core

Anthony Sneed

View the Blog Page

Visit the full article: Announcing Event Driven .NET – An Event Driven Microservices Platform for .NET | Posted on March 21, 2022 by Tony Sneed

Why I added this Repository/Article/Blog/PR?

https://blog.tonysneed.com/2020/06/25/event-stream-processing-micro-framework-apache-kafka/

Ivan Cesar

Hi there! I’m Ivan Cesar.

An Elasticsearch Tutorial for .NET Developers

https://www.toptal.com/dot-net/elasticsearch-dot-net-developers

Category: Elasticsearch, .NET, Tutorial

Published on: 8 September 2017

Author: Ivan Cesar

Visit the full article: An Elasticsearch Tutorial for .NET Developers

elastic-net-example Public

This is an example of how Elastic Search can be integrated easily with .NET application. Feel free to fork/comment if you like.

49 34

ItamarSyn-Hershko

Hi there! I’m ItamarSyn-Hershko.
CTO & Founder of BigData Boutique. Search, BigData and Cloud expert - making the cloud a better place day by day.

Securing Elasticsearch Clusters

A number of articles have been written over the past few days documenting the various methods of securing Elasticsearch, most notably of which is this piece by Itamar Dyn-Hershko. For all our readers using Elasticsearch — especially those who are using it in production — who are not necessarily aware of the various pitfalls that need to be taken into consideration, we’ve summed up some of the methods that we recommend employing.

Marc Clifton

Hi there! I’m Marc Clifton.

http://marcclifton.wordpress.com/

Articles by Marc Clifton

Azure Function: Compute Pi Stress Test

KerasHub: Multi-framework Pretrained Models

Pretrained model hub for Keras 3.

Anshuman Mishra

GDE; Google Summer of Code'23 @tensorflow; Contributor @keras-team;

Visit the authoring page here: Authoring Page

Medium

Anshuman Mishra (@mishradotexe)

I wrote Qwen 2.5 from scratch. Works with JAX, PyTorch and Tensorflow. This marks my return to open source after an year.

View the Tweet
PR mentioned in the tweet: Add Qwen 2.5 by shivance · Pull Request #2088 · keras-team/keras-hub · GitHub

Commit: checkpoint conversion wip

New File Added: tools/checkpoint_conversion/convert_qwen_checkpoints.py

View Commit

Now, see another PR by the same Keras Team: SIMILAR TASK

Overview of commits/PRs from Sep 1, 2024 to Sep 30, 2024

Backend URL Link https://github.com/divyashreepathihalli?tab=overview&from=2024-09-01&to=2024-09-30

add weights and conversion script for mobilenet#1875

divyashreepathihalli

Here are the details of a specific PR from the keras-team/keras-hub repository:

Dask

Parallel computing with task scheduling

Dask

Dask is a Python library for parallel and distributed computing.

Dask

Dask is a Python library for parallel and distributed computing.

Yoshifumi Kawai

ZLinq Public

Zero allocation LINQ with Span and LINQ to SIMD, LINQ to Tree (FileSystem, Json, GameObject, etc.) for all .NET platforms and Unity.

1.4k 108

c-sharp, linq, unity

Mohammad Elsheimy

Sam Grey Danus

Windscape AI and Greenfield Properties. Previously @google Brain, @dartmouth College

https://greydanus.github.io/about_me/
@samgreydanus

Sam Grey Danus (https://github.com/greydanus) starred 2 repositories on 02/4/25
https://github.com/zwimpee/cursivetransformer (Jupyter Notebook) 3 Stars
https://github.com/zwimpee2/cursivetransformer (Jupyter Notebook) 1 Star

Both repositories, https://github.com/zwimpee/cursivetransformer and https://github.com/zwimpee2/cursivetransformer, focus on training a transformer model to generate cursive, with progress updates noted in their respective README files (February 12, 2025, and August 13, 2025).

MartinDotNet

Hi there! I’m MartinDotNet.

https://martinjt.me

timdeschryver (https://github.com/timdeschryver) starred 2 repositories on 25/3/25
practical-otel/opentelemetry-aspire-collector (C#) 28 STARS
microsoft/playwright-mcp

opentelemetry-aspire-collector Public

29 0

Visit the article "Building a Secure OpenTelemetry Collector" published on 20 December, 2023 here: Building a Secure OpenTelemetry Collector

ocb-config-builder Public

Building a secure OpenTelemetry Collector December 20, 2023

13 1

ABP

Hi there! I’m ABP.ABP
ABP offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET and the ASP.NET Core platforms.

abp Public

Open-source web application framework for ASP.NET Core! Offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET. Provides the fundamental ...

13.3k 3.5k

aspnetcore, serilog, aspnet-core

Alexandre Mutel (https://github.com/xoofx) starred a repository on 25/3/25
abpframework/abp (C#) 13.3k STARS

ABP offers an opinionated architecture to build enterprise software solutions with best practices on top of the .NET and the ASP.NET Core platforms. It provides the fundamental infrastructure, production-ready startup templates, pre-built application modules, UI themes, tooling, guides and documentation to implement that architecture properly and automate the details and repetitive works as much as possible.

OpenTelemetry - CNCF

Hi there! I’m OpenTelemetry - CNCF.

OpenTelemetry makes robust, portable telemetry a built-in feature of cloud-native software.

OpenTelemetry Collector Public

OpenTelemetry Collector

opentelemetry.io

5k 1.6k

MartinDotNet

Hi there! I’m MartinDotNet.

Safia Abdalla

Hi there! I’m Safia Abdalla.

Overview of commits/PRs from Oct 1, 2024 to Oct 31, 2024

Backend URL Link https://github.com/0xced?tab=overview&from=2024-09-01&to=2024-09-30

Implement the Mvc PushFileStreamResult API#58161

Cédric_Luthi

CaptainSafia Review

Reviewed on: December 17, 2024

Check out the GitHub Profile of CaptainSafia on GitHub.

Here are the details of a specific commit from the dotnet/aspnetcore repository:

Roger Koenker

Hi there! I’m Roger Koenker.

How to Run Regression on Large Datasets in R

October 2, 2011 | Programming, R, Statistics

Visit the original article on Statr.me(https://statr.me/2011/10/large-regression/).

Xiao Nan yixuanq 12 years ago: Yup. There's more. Prof. Roger Koenker once combined MySQL with his qr: Link. There's barely few experiments on the cluster & classification's hpc topic. I think the algorithms are just naturally inefficient or too complicated to reimplement.

Statistical Analysis of Large Datasets - An Exploration of R - MySQL Interface:Visit the link by Roger Koenker, University of Illinois, and Álvaro A. Novo, University of Illinois. Topics include Least Squares and Quantile Regression.

Visit the R Vinaigrettes Page - contains

Conformal Quantile Regression pdf

Single instruction, multiple data (SIMD)

Single instruction, multiple data (SIMD).

Fei Peng

Hi there! I’m Fei Peng.

Timur Iskhakov

Hi there! I’m Timur Iskhakov.

Kristoffer Carlsson

Software engineer, Julia Computing

Visit the Blog Post titled SIMD and SIMD-intrinsics in Julia

Gérald Barré (@meziantou)

Hi there! I’m Gérald Barré (@meziantou).

Visit the full article: Replace characters in a string using Vectorization | Posts · 11/7/2022 · 4 minutes

Friedrich von Never(@ForNeVeR)

Visit the full article: Code Vectorization in .NET and Other Technologies | Posts · 32/10/2023 · 5 minutes

SciML Open Source Scientific Machine Learning

Open source software for scientific machine learning

Overview of commits/PRs from Jan 1, 2025 to Jan 31, 2025

Backend URL Link https://github.com/ChrisRackauckas?tab=overview&from=2025-01-01&to=2025-01-31

Test Master#1159

ChrisRackauckas

Here are the details of a specific commit from the SciML/SciMLSensitivity.jl repository:

Avik Pal

DataLoader from MLUtils https://lux.csail.mit.edu/stable/tutorials/intermediate/1_NeuralODE#Loading-MNIST

Yunjey Choi

https://github.com/yunjey/pytorch-tutorial

Vincent D. Warmerdam

Visit the The factorio benchmark blog posted on Date 2025/03/10

This page has all the details of the work, which include:

A Python library that can interact with the game, which is the main entrypoint for the agents that compete in tasks.
A leaderboard with the results of the agents that have competed so far (Claude seems the winner, but the fact that one of the authors is from Anthropic might help there).

https://github.com/JackHopkins/factorio-learning-environment

Why I added this Repository/Article/Blog/PR?

Linear Regression in Machine learning

Benoît Legat

Practical 1 – Linear regressions | Benoît Legat | Written by: Jean Bouchat

Recommended for you:
jump-dev/MathOptInterface.jl (https://github.com/jump-dev/MathOptInterface.jl) is a data structure for mathematical optimization problems in Julia.
MathOptInterface.jl (Julia) 434 Stars
Contributors:View Contributors

Paul Berg

Overview of commits/PRs from Feb 1, 2025 to Feb 28, 2025

Backend URL Link https://github.com/avik-pal?tab=overview&from=2024-08-01&to=2024-08-31

feat: more coverage for common NN operations#95

avik-pal

Here are the details of a specific PR from the EnzymeAD/Reactant.jl repository:

Reviewers

wsmoses

Pangoraw

Visit the full article: Fast online estimates on the GPU | Posts · 06/08/2021 · 4 minutes

Benoît Legat

The Applications of Mathematical Optimisation, Mixed-integer Linear Programming

Course given at the Cambridge Centre for International Research

https://blegat.github.io/teaching/

https://blegat.github.io/ccir/practical1/

The Python Pickle Module

Matthew Rocklin

Hi there! I’m Matthew Rocklin.

Guillaume Lemaitre

Hi there! I’m Guillaume Lemaitre.

Matthew Rocklin

Hi there! I’m Matthew Rocklin.

Visit the full article: Pickle isn't slow, it's a protocol | Posted on 2018/07/23 by Matthew Rocklin

Guillaume Lemaitre

Hi there! I’m Guillaume Lemaitre.

Overview of commits/PRs from Feb 1, 2025 to Feb 28, 2025

Backend URL Link https://github.com/glemaitre?tab=overview&from=2024-12-01&to=2024-12-31

Using pickling as much as possible for serialization#966

glemaitre

Here are the details of a specific PR from the probabl-ai/skore repository:

Andriy Burkov

Hi there! I’m Andriy Burkov.

Andriy Burkov (https://github.com/aburkov) starred a repository on 03/4/25
https://github.com/erikbern/ann-benchmarks (Python) 5.2k Stars

Benchmarks of approximate nearest neighbor libraries in Python

ann-benchmarks.com

Why I added this Repository/Article/Blog/PR?

pgvector Module (Python)

"""
This module supports connecting to a PostgreSQL instance and performing vector
indexing and search using the pgvector extension. The default behavior uses
the "ann" value of PostgreSQL user name, password, and database name, as well
as the default host and port values of the psycopg driver.
"""

Dockerfile Configuration

RUN service postgresql start && \
    psql -c "CREATE USER ann WITH ENCRYPTED PASSWORD 'ann'" && \
    psql -c "CREATE DATABASE ann" && \
    psql -c "GRANT ALL PRIVILEGES ON DATABASE ann TO ann" && \
    psql -d ann -c "GRANT ALL ON SCHEMA public TO ann" && \
    psql -d ann -c "CREATE EXTENSION vector" && \
    psql -c "ALTER USER ann SET maintenance_work_mem = '4GB'" && \
    psql -c "ALTER USER ann SET max_parallel_maintenance_workers = 0" && \
    psql -c "ALTER SYSTEM SET shared_buffers = '4GB'"

USER root

.NET 8 container workshop

.NET 8 container workshop.

Martin Krasser

Martin Krasser.

Martin Krasser(https://github.com/krasserm) starred a repository on 26/3/25
wjayesh/mahilo (Python) 360 STARS

Overview of commits/PRs from Apr 1, 2025 to Apr 30, 2025

Backend URL Link https://github.com/krasserm?tab=overview&from=2021-04-01&to=2021-04-30

fsdl-text-recognizer-2021-labsPublic
forked from https://github.com/the-full-stack/fsdl-text-recognizer-2021-labs

krasserm

Here are the details of the fork of this fsdl-text-recognizer-2021-labs repository:

fsdl-text-recognizer-2021-labs

fsdl-text-recognizer-2021-labsPublic

forked from https://github.com/the-full-stack/fsdl-text-recognizer-2021-labs

Why I added this Repository/Article/Blog/PR?

Simon Willison

Hi there! I’m Simon Willison.

Simon Willison(https://github.com/simonw) contributed to a repository on 14/5/25
taketwo/llm-ollama (Python) 292 STARS

Followings of Sergey Alexandrov(taketwo) on GitHub

Followings of Konrad Rudolph(klmr) on GitHub

Followings of Paolo Di Tommaso(pditommaso) on GitHub

Stephen Turner

Hi there! I’m Stephen Turner.

Visit the full article: DuckDB vs dplyr vs base R | Posts · 7/10/2024

Simon Willison(https://github.com/simonw) starred a repository on 3/5/25
skyzh/tiny-llm (Python, C++) 1.8k STARS

Simon Willison(https://github.com/simonw) starred a repository on 25/4/25
antirez/hnstyle (Python) 40 STARS

This repository contains the code used in the following blog post and YouTube videos:

Now, we are ready to insert the word into a Redis vector set, using the command: VADD key FP32 [blob with 350 floats] username. The details of vector sets are not covered here, but you can find the documentation here. For additional information regarding Redis, you may also check out this: Visit the full article: Reproducing Hacker News writing style fingerprinting Date: 16/4/25.

Simon Willison(https://github.com/simonw) starred a repository on 14/4/25
invisal/sqlite-internal (JavaScript, TypeScript) 260 STARS

https://github.com/querymx/querym

Why I added this Repository/Article/Blog/PR?

CUDA Programming Model

Timur Iskhakov

Rotation related problems

Rotation_related_problems. Quarternions. Rodrigues’ rotation formula

Andrzej Więckowski, Ph.D.

John Parkhill ML, director of machine learning Terray Therapeutics (https://x.com/Terray_Tx).

Alex Chi Z.

Simon Willison(https://github.com/simonw) starred a repository on 3/5/25
skyzh/tiny-llm (Python, C++) 1.8k STARS

Visit the full article: Delta Join in the Streaming Engine based on a shared state index | Posts · 29/15/2022 · 10 minutes

Andrzej Więckowski, Ph.D.

Martin Evans(https://github.com/martindevans) starred 3 repositories on 2/5/25
BurstMathUtils (C#) 28 STARS
BurstCollections (C#) 78 STARS
PBD2D (C#) 98 STARS

Visit the full article: Rodrigues’ formula | Posts · 16/02/2025 · 4 minutes

John Parkhill ML, director of machine learning Terray Therapeutics (https://x.com/Terray_Tx).

Sep 5, 2020

Quaternion Averaging in Pytorch

Timur Iskhakov

Visit the full article: Computing the Convex Hull on GPU | Posts · 05/10/2020 · 10 minutes · c# cuda algorithms

Ahmet Alp Balkan

Ahmet Alp Balkan.

Ahmet Alp Balkan(https://github.com/ahmetb) followed Anish Athalye (https://github.com/anishathalye) on 5/4/25

Ahmet Alp Balkan(https://github.com/ahmetb) starred a repository on 5/4/25
anishathalye/porcupine (Go) 1k STARS

Anish Athalye

Anish Athalye.

Ahmet Alp Balkan(https://github.com/ahmetb) followed Anish Athalye (https://github.com/anishathalye) on 5/4/25

Ahmet Alp Balkan(https://github.com/ahmetb) starred a repository on 5/4/25
anishathalye/porcupine (Go) 1k STARS

Mat Leonard

Mat Leonard.

Followings of Kritika Prakash on GitHub

Visit the full article: Tutorial - How to use Sampyl with a simple linear model | Posts · 21/9/2018 · 10 minutes

Visit the full GitHub Documentation: Examples - Models built with Sampyl - German Tank Problem

Visit the full GitHub Documentation: Example - German Tank Problem, a classic problem in statistics

Oriol Nieto

Oriol Nieto.

Realcat Vincentqyw(https://github.com/Vincentqyw) starred a repository on 7/5/25
huggingface/nanoVLM (Jupyter Notebook 79.9%, Python 20.1%) 961 STARS

Contributor Rankings

#1 Andrés Marafioti - 1 commits | GitHub Profile

andimarafioti Andrés Marafioti

Machine Learning Research Engineer at Hugging Face.

51 repositories236 followers

follows

andimarafioti Oriol Nieto

Senior Research Engineer at Adobe Research. Doctor in music data science (Doctoriol). Oaklander born in Barcelona. He/they.

52 repositories226 followers

Visit the full article: Tutorial - Deep XOR | Posts · 26/2/2017 · 1 minute

Conrad Ludgate

Conrad Ludgate.

Visit the full article: Postgres | Posts · 5/11/2023

Why I added this Repository/Article/Blog/PR?

I've always run the basic postgres docker image with no backups or replicas configured. Since I have a new cluster now, I thought I should try something new. I recently read aboutCloudNative PG on HN so I decided to look into it. It got high praise from the replies, which is quite remarkable for HN.

It seems to have all the features I would want from a 'managed' postgres:

Managed backups
Easily create a new database
Manage secrets

SQL Join

SQL Join.

Alibaba Cloud Community

Alibaba Cloud Community.

Visit the Blog Post titledHow to Write a High-Performance SQL Join: Implementation and Best Practices of Joins

Alibaba Cloud Community

Alibaba Cloud Community.

Visit the Blog Post titledHow to Write a High-Performance SQL Join: Implementation and Best Practices of Joins

Xuan-Son Nguyen

Realcat Vincentqyw(https://github.com/Vincentqyw) starred a repository on 13/5/25
ngxson/smolvlm-realtime-webcam (HTML) 3.2k STARS

Visit the full article: Easier to Understand: Natural Language Processing | Posts · 10/2/2024 · 5 minutes

Why I added this Repository/Article/Blog/PR?

In fact, convolution neural networks work quite well with images, because in the worst case, you can cut the image to a certain size. For example, creating a model to recognize handwritten digits (MNIST dataset) is one of the very typical and easy-to-experiment exercises for newcomers to machine learning.

Guillaume Guy

Overview of commits/PRs from Apr 1, 2025 to Apr 30, 2025

Backend URL Link https://github.com/rwightman?tab=overview&from=2025-04-01&to=2025-04-30

Initial work on adding local-dir: schema for model & tokenizer loading from local folder#1069

rwightman

Here are the details of a specific commit from the mlfoundations/open_clip repository:

GG - guillaumeguy

FYI: Looks good to me anecdotally:

Visit the full article: Don't use raw embeddings | Posts · 16/4/2025 · 3 minute

Why I added this Repository/Article/Blog/PR?

However, embeddings are still quite large. OpenAI's text-embedding-3-large can reach up to d=3072, which means 6kB (stored as float32) per entity. From experience, this is enough to overwhelm SQL engines when performing large JOINs, as this data needs to be sent across the network for a distributed JOIN.

Hrishi Olickel

Hi there! I’m Hrishi Olickel.

Visit the full article: Subqueries and CTEs: an example of query optimization in Postgres | Optimization isn't always premature | Posts · 9/10/2020 · 5 minutes

Anton Zhiyanov

Hi there! I’m Anton Zhiyanov.

Visit the full article: SQL join flavors | Posts · 20/6/20203 · 5 minutes

codapi — Interactive code examples for all types of technical writing.
redka — Redis re-implemented with SQLite.
sqlean — SQLite extensions.

James (@capjamesg)

Hi there! I’m James (@capjamesg).

Visit the full article: Building a NoSQL database in Python | Posts · 19/8/2024 · 5 minutes

Zongheng Yang (@concretevitamin)

Hi there! I’m Zongheng Yang (@concretevitamin).

Followings of Kevin Frans on GitHub

gmittal Gautam Mittal

126 repositories226 followers

Gautam Mittal (https://github.com/gmittal) has a repository
skypilot-org/skypilot (Python) 8.2k STARS

Contributor Rankings

#1 Michaelvll - 985 commits | GitHub Profile
#2 Zongheng Yang - 531 commits | GitHub Profile

Visit the full article: SQL Query Optimization Meets Deep Reinforcement Learning | Posts · 18/9/2018

Chris Done (@chrisdone)

Hi there! I’m Chris Done (@chrisdone).

Artin Ghasivand(https://github.com/Ei30metry) followed a GitHub user on 27/6/24
https://github.com/chrisdone

Artificial Labs @artificialio

Visit the full article: Fast pagination on PostgreSQL | Posts · 19/11/2014 · 2 minutes

Jayesh Sharma (@wjayesh)

Hi there! I’m Jayesh Sharma (@wjayesh).

Martin Krasser(https://github.com/krasserm) starred a repository on 26/3/25
wjayesh/mahilo (Python) 360 STARS

Visit the full article: Building distributed apps using Dapr, locally and on Azure | Posts · 17/3/2021 · .Net Programming, Microservices · 10 minutes

📑 Latest Blog Posts

Reshama Shaikh (@reshamas)

Hi there! I’m Reshama Shaikh (@reshamas).

Visit the full article: Fastai Week 2 Classifying African And Asian Elephants | Posts · 5/11/2018 · 4 minutes

Visit the full article: Fastai Week 1 Classifying Camels Horses And Elephants | Posts · 28/10/2018 · 5 minutes

Visit the full article: My First Kaggle Competition | Posts · 18/4/2018 · 5 minutes

Gérald Barré (@meziantou)

Hi there! I’m Gérald Barré (@meziantou).

📗 Recent blog posts

Alexandre Mutel (https://github.com/xoofx) followed a GitHub user on 2/5/25
meziantou

Visit the full article: Prevent accidental disclosure of configuration secrets | Posts · 13/2/2023 · 4 minutes

Visit the full article: Replace characters in a string using Vectorization | Posts · 11/7/2022 · 4 minutes

Books

Hi there! I’m Books.

Michael Tarlton

Hi there! I’m Michael Tarlton.

Michael Tarlton

@MichaeTa

Just re-illustrating the example from the Russell book Chapter 21. Note how the unit “numbers” have changed. Give it a shot if you have literally nothing else to do. There is a reason we make computers do this.

Visit the Wikipedia Page: Artificial Intelligence: A Modern Approach | Written by Stuart J. Russell and Peter Norvig

AIMA has been called "the most popular artificial intelligence textbook in the world",^[2] and is considered the standard text in the field of AI.^[3]^[4] As of 2023, it was being used at over 1500 universities worldwide,^[5] and it has over 59,000 citations on Google Scholar.^[6]

llama Public

llama -- A CLI for outsourcing computation to AWS Lambda.

595 24

Overview of commits/PRs from Jul 1, 2018 to Jul 31, 2018

Backend URL Link https://github.com/mrocklin?tab=overview&from=2018-07-01&to=2018-07-31

Serialization of data within a tensor is slow #9168

mrocklin

Here are the details of a specific PR from the pytorch/pytorch repository:

Contributor Rankings

#1 SkalskiP - 40 commits | GitHub Profile

Ido Shamun

Visit the full article: SQL Join vs Subquery: The Game Changer | Posts · 19/11/2020 · 3 minutes

Su Yang

Visit the full article: Intel B580 GPU Large Model Container Inference Practice: A Case Study of DeepSeek R1 Distill Qwen 7B (Part 1) | Posts · 7/2/2025 · 25 minutes

Asankhaya Sharma

Asankhaya Sharma(https://github.com/codelion)
codelion launched their sponsorship page 💖 Asankhaya Sharma codelion on 10/6/25

Asankhaya Sharma(https://github.com/codelion) Trending repositories on 23/5/25
codelion/openevolve (Python) 2.3k STARS

Max Liani

Followings of @h3r2tic on X

Jiayin Cao

@Jiayin_Cao

Followings of @Jiayin_Cao on X

Max Liani

@maxliani

Raytracing Director at Nvidia. Previously: Tech Lead for RenderMan at Pixar, Architect of Glimpse Renderer at Animal Logic Views are my own.

Visit the full article: DNND 1: a Deep Neural Network Dive | Posts · 27/3/2023 · Software Development · 5 minutes

Marlene

Pythonista and Developer Advocate at Microsoft 🥑✨ Learning and teaching Python🐍

Anthony Shaw(https://github.com/tonybaloney) Created a pull request on 2/6/25
langchain-ai/langchain-community/pull/88

Overview of commits/PRs from Jun 1, 2025 to Jun 30, 2025

Backend URL Link https://github.com/tonybaloney?tab=overview&from=2025-06-01&to=2025-06-30

Harden Azure ML url validation#88

tonybaloney

Here are the details of a specific PR from the langchain-ai/langchain-community repository:

Visit the full article: An Introduction to Ibis for Python Programmers | A More Pythonic Way To Work With Databases | Posts · 14/3/2022 · Python, Databases · 10 minutes

Edzer Pebesma

geoinformatics, spatial statistics, R.

Institute for Geoinformatics, Universität Münster
Münster, Germany

Visit the full article: Setting up large scale OSM environments for R using Osmosis and PostgreSQL with PostGIS | Posts · 14/6/2017 · Databases, geoinformatics, spatial statistics, R. · 5 minutes

Microsoft MVP

Gérald Barré (@meziantou)

Hi there! I’m Gérald Barré (@meziantou).

Anthony Shaw(@tonybaloney)

Anthony Shaw(https://github.com/tonybaloney) contributed to
Azure-Samples/eShopLite on 31/5/25

Anthony Shaw(https://github.com/tonybaloney) contributed to
langchain-ai/langchain-azure - langchain-ai/langchain-azure/pull/99 on 10/6/25

Overview of commits/PRs from Jun 1, 2025 to Jun 30, 2025

Backend URL Link https://github.com/tonybaloney?tab=overview&from=2025-06-01&to=2025-06-30

Replace MD5 with SHA256 for cache index entry keys and names#99

tonybaloney

Here are the details of a specific PR from the langchain-ai/langchain-azure repository:

Why I added this Repository/Article/Blog/PR?

MD5 and SHA1 should never be used for cache keys because there is a chance of collisions.

The implication here is that the in-process cache dictionary will use different cache keys, but that doesn't matter since it's stored in memory and you need to restart to run this update.

Jirka Borovec(@Borda)

Jirka Borovec(https://github.com/Borda) contributed to
Lightning-AI/lightning-thunder/ on 10/6/25

Overview of commits/PRs from Jun 1, 2025 to Jun 30, 2025

Backend URL Link https://github.com/Borda?tab=overview&from=2025-06-01&to=2025-06-30

fixed installing NCCL for CUDA#2208

Borda

Here are the details of a specific PR from the Lightning-AI/lightning-thunder/pull/2208 repository:

Ethan Harris(@ethanwharris)

Code for our paper "FMix: Enhancing Mixed Sample Data Augmentation"
Used by the second place team in the Bengali.AI Handwritten Grapheme Classification Kaggle competition and by the third place team in the Rainforest Connection Species Audio Detection Kaggle competition

Visit the full article: Bengali.AI Handwritten Grapheme Classification | Classify the components of handwritten Bengali | Discussion · Bengali.AI · Research Code Competition · 17/3/2020 · 10 minutes

Jirka Borovec(https://github.com/Borda) contributed to
Lightning-AI/lightning-thunder/ on 10/6/25

Overview of commits/PRs from Jun 1, 2025 to Jun 30, 2025

Backend URL Link https://github.com/Borda?tab=overview&from=2025-06-01&to=2025-06-30

fixed installing NCCL for CUDA#2208

Borda

Here are the details of a specific PR from the Lightning-AI/lightning-thunder/pull/2208 repository:

David Pine(@IEvangelist)

Anthony Shaw(https://github.com/tonybaloney) contributed to
Azure-Samples/eShopLite on 31/5/25

Followings of El Bruno(@elbruno) on GitHub

Bohdan Stupak(@Wkalmar)

Visit the full article: Improve C# code performance with Span<T> | Posts · 24/3/2025 · 9 minutes

Friedrich von Never(@ForNeVeR)

Alexandre Mutel (https://github.com/xoofx) followed Friedrich von Never (https://github.com/ForNeVeR) on 21/2/25.

Visit the full article: Code Vectorization in .NET and Other Technologies | Posts · 32/10/2023 · 5 minutes

Visit the full article: A variant of a diff algorithm for constrained conditions | Posts · 12/2/2021 · 5 minutes

Will DePue(@0hq)

Visit the full news article: Meet Girl Who Mastered Coding At 11, Built Rs 1,00,00,00,000 Crore Startup At Just 16— She Is… | News Article · 15/6/2025 · AI · 5 minutes

Followings of @raidingAI on X

Pranjali Awasthi

@raidingAI

Visit the full news article: A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!) | GitHub Repository · 12/6/2023 · the tiny, least-dumb, speedy vector embedding database · 5 minutes

Visit the full news article: The Missing WHERE Clause in Vector Search | Posts · 30/6/2023 · the tiny, least-dumb, speedy vector embedding database · 5 minutes

Adam Sitnik(@adamsitnik)

Open Source contributor, #BenchmarkDotNet maintainer. My job on .NET Team is to make the .NET the fastest developer platform on the planet.

Visit the full article: Span | Posts · 13/7/2017 · Span<T> · 5 minutes

Ash Vardanian(@ashvardanian)

Matt DesLauriers (https://github.com/mattdesl) starred a repository on 17/6/25
https://github.com/mattdesl/canvas-dimensions (JavaScript ) 32 STARS

Created an issue in unum-cloud/uform that received 2 comments on 9/6/25

Jimmy Lefevre(@JimmyLefevre)

Jimmy Lefevre - LinkedIn

Passionate about technology, I am particularly committed to training myself and understanding new development best practices. From web to software, including mobile applications and IoT, I put my skills to the service of innovation, but also of transmission by giving courses at CESI in Dijon, Lille, and Nancy, and by sharing my knowledge with my colleagues whenever possible.

Alexandre Mutel (https://github.com/xoofx) starred a repository on 20/6/25
JimmyLefevre/kb (C) 275 STARS

kb Public

kb single-header C/C++ libraries.

279 1

Why I added this Repository/Article/Blog/PR?

kb_text_shape.h

kb_text_shape.h provides ICU-like text segmentation (i.e. breaking Unicode text by direction, line, word and grapheme). It also provides Harfbuzz-like text shaping for OpenType fonts, which means it is capable of handling complex script layout and ligatures, among other things.

Shay Rojansky(@roji)

Wes Doyle(@wesdoyle) (https://github.com/wesdoyle) followed a GitHub user on 4/1/24
roji

Accordion

Shay Rojansky

Microsoft software engineer working on .NET data access and perf, member of the Entity Framework team. Lead dev of Npgsql, the PostgreSQL provider.

Shay Rojansky

Microsoft software engineer working on .NET data access and perf, member of the Entity Framework team. Lead dev of Npgsql, the PostgreSQL provider.

Queryable PostgreSQL arrays in EF Core 8.0

6 minute read

Queryable collections?

When "UTC" everywhere isn't enough - storing time zones in PostgreSQL and SQL Server

8 minute read

When "UTC everywhere" isn't enough

Query parameters, batching and SQL rewriting

7 minute read

When "UTC everywhere" isn't enough In the upcoming version 6.0 of the Npgsql PostgreSQL driver for .NET, we implemented what I think of as "raw mode" (#3852). In a nutshell, this means that you can now use Npgsql without it doing anything to the SQL you provide it - it will simply send your queries as-is to PostgreSQL, without parsing or rewriting them in any way

Tags: Microsoft software engineer working on .NET data access and perf, member of the Entity Framework team. Lead dev of Npgsql, the PostgreSQL provider.