Matthew Rocklin
  • About Me
  • Tags
  • Current Work

Working Notes from Matthew Rocklin

  • New Blog and RSS Feed: 03 Jan 2023
  • Startup Revenue: 25 Jul 2022
  • SciPy Mission -- What's next?: 19 Jul 2022
  • Small Scope and Fast Review: 26 Mar 2022
  • Seven Stages of Ownership: 18 Feb 2022
  • Admirable Traits of Tech Leads: 24 Jun 2021
  • Where are you based?: 28 Feb 2021
  • Employee Benefits: 11 Feb 2021
  • Personal COVID Risk: 14 Jan 2021
  • Hockey Stick growth and Github Stars: 28 Oct 2020
  • My Ideal Tweet: 18 Aug 2020
  • Write to a short attention span: 13 Jul 2020
  • I'm Founding a Dask Company: 08 Jan 2020
  • Sales is about listening: 05 Dec 2019
  • What is a Senior Engineer?: 09 Nov 2019
  • Reasons to keep your on-site HPC center: 01 Oct 2019
  • Tips for Interactive HPC: 01 Oct 2019
  • Non-Technical roles in OSS: 28 Aug 2019
  • HTML outputs in Jupyter: 04 Jul 2019
  • Write Short Blogposts: 25 Jun 2019
  • Avoid Indirection in Code: 23 Jun 2019
  • The Role of a Maintainer: 18 May 2019
  • Why I Avoid Slack: 28 Feb 2019
  • Dask, Pandas, and GPUs: first steps: 13 Jan 2019
  • GPU Dask Arrays, first steps: 03 Jan 2019
  • First Impressions of GPUs and PyData: 17 Dec 2018
  • Support Python 2 with Cython: 28 Nov 2018
  • Anatomy of an OSS Institutional Visit: 27 Nov 2018
  • Joining NVidia: 26 Nov 2018
  • So you want to contribute to open source: 12 Oct 2018
  • Dask Development Log: 17 Sep 2018
  • Dask Release 0.19.0: 05 Sep 2018
  • High level performance of Pandas, Dask, Spark, and Arrow: 28 Aug 2018
  • Public Institutions and Open Source Software: 21 Aug 2018
  • Cloud Lock-in and Open Standards: 19 Aug 2018
  • Building SAGA optimization for Dask arrays: 07 Aug 2018
  • Dask Development Log: 02 Aug 2018
  • Pickle isn't slow, it's a protocol: 23 Jul 2018
  • Dask Development Log, Scipy 2018: 17 Jul 2018
  • Who uses Dask?: 16 Jul 2018
  • Dask Development Log: 08 Jul 2018
  • Dask Scaling Limits: 26 Jun 2018
  • Dask Release 0.18.0: 14 Jun 2018
  • Beyond Numpy Arrays in Python: 27 May 2018
  • Dask Release 0.17.2: 21 Mar 2018
  • Summer Student Projects 2018: 20 Mar 2018
  • Craft Minimal Bug Reports: 28 Feb 2018
  • Dask Release 0.17.0: 12 Feb 2018
  • Credit Modeling with Dask: 09 Feb 2018
  • HDF in the Cloud: 06 Feb 2018
  • The Case for Numba in Community Code: 30 Jan 2018
  • Write Dumb Code: 27 Jan 2018
  • Pangeo: JupyterHub, Dask, and XArray on the Cloud: 22 Jan 2018
  • Dask Development Log: 06 Dec 2017
  • Dask Release 0.16.0: 21 Nov 2017
  • Optimizing Data Structure Access in Python: 03 Nov 2017
  • Streaming Dataframes: 16 Oct 2017
  • Notes on Kafka in Python: 10 Oct 2017
  • Dask Release 0.15.3: 24 Sep 2017
  • Fast GeoSpatial Analysis in Python: 21 Sep 2017
  • Dask on HPC - Initial Work: 18 Sep 2017
  • Dask Release 0.15.2: 30 Aug 2017
  • Scikit-Image and Dask Performance: 18 Jul 2017
  • Dask Benchmarks: 03 Jul 2017
  • Use Apache Parquet: 28 Jun 2017
  • Programmatic Bokeh Servers: 28 Jun 2017
  • Dask Release 0.15.0: 15 Jun 2017
  • Dask Release 0.14.3: 08 May 2017
  • Dask Development Log: 28 Apr 2017
  • Asynchronous Optimization Algorithms with Dask: 19 Apr 2017
  • Streaming Python Prototype: 13 Apr 2017
  • Dask and Pandas and XGBoost: 28 Mar 2017
  • Dask Release 0.14.1: 23 Mar 2017
  • Developing Convex Optimization Algorithms in Dask: 22 Mar 2017
  • Biased Benchmarks: 09 Mar 2017
  • Dask Release 0.14.0: 27 Feb 2017
  • Dask Development Log: 20 Feb 2017
  • Experiment with Dask and TensorFlow: 11 Feb 2017
  • Two Easy Ways to Use Scikit Learn and Dask: 07 Feb 2017
  • Dask Development Log: 30 Jan 2017
  • Custom Parallel Algorithms on a Cluster with Dask: 24 Jan 2017
  • Dask Development Log: 18 Jan 2017
  • Distributed NumPy on a Cluster with Dask Arrays: 17 Jan 2017
  • Distributed Pandas on a Cluster with Dask DataFrames: 12 Jan 2017
  • Dask Release 0.13.0: 03 Jan 2017
  • Dask Development Log: 24 Dec 2016
  • Dask Development Log: 18 Dec 2016
  • Dask Development Log: 12 Dec 2016
  • Dask Development Log: 05 Dec 2016
  • Dask Cluster Deployments: 22 Sep 2016
  • Dask and Celery: 13 Sep 2016
  • Where to Write Prose?: 12 Sep 2016
  • Dask Distributed Release 1.13.0: 12 Sep 2016
  • Supporting Users in Open Source: 25 Aug 2016
  • Dask for Institutions: 16 Aug 2016
  • Dask and Scikit-Learn -- Model Parallelism: 12 Jul 2016
  • Ad Hoc Distributed Random Forests: 20 Apr 2016
  • Fast Message Serialization: 14 Apr 2016
  • Distributed Dask Arrays: 26 Feb 2016
  • Pandas on HDFS with Dask Dataframes: 22 Feb 2016
  • Introducing Dask distributed: 17 Feb 2016
  • Write tests: 08 Feb 2016
  • Disk Bandwidth: 29 Dec 2015
  • Data Bandwidth: 29 Dec 2015
  • Dask is one year old: 21 Dec 2015
  • Distributed Prototype: 09 Oct 2015
  • Efficient Tabular Storage: 28 Aug 2015
  • A Weekend with Asyncio: 10 Aug 2015
  • Caching: 03 Aug 2015
  • Custom Parallel Workflows: 23 Jul 2015
  • Write Complex Parallel Algorithms: 26 Jun 2015
  • Distributed Scheduling: 23 Jun 2015
  • Pandas Categoricals: 18 Jun 2015
  • State of Dask: 19 May 2015
  • Profiling Data Throughput: 21 Apr 2015
  • Partition and Shuffle: 25 Mar 2015
  • Efficiently Store Pandas DataFrames: 16 Mar 2015
  • Towards Out-of-core DataFrames: 11 Mar 2015
  • PyData and the GIL: 10 Mar 2015
  • Ising models and Numba: 28 Feb 2015
  • Towards Out-of-core ND-Arrays -- Dask + Toolz = Bag: 17 Feb 2015
  • Towards Out-of-core ND-Arrays -- Slicing and Stacking: 13 Feb 2015
  • Into and Remote Data: 11 Feb 2015
  • ReIntroducing Into: 03 Feb 2015
  • Towards Out-of-core ND-Arrays -- Spilling to Disk: 16 Jan 2015
  • Towards Out-of-core ND-Arrays -- Benchmark MatMul: 14 Jan 2015
  • Towards Out-of-core ND-Arrays -- Multi-core Scheduling: 06 Jan 2015
  • Towards Out-of-core ND-Arrays -- Frontend: 30 Dec 2014
  • Towards Out-of-core ND-Arrays: 27 Dec 2014
  • Blaze Datasets: 19 Nov 2014
  • Introducing Blaze - Migrations: 01 Sep 2014
  • Introducing Blaze - Practice: 01 Sep 2014
  • Introducing Blaze - Expressions: 01 Sep 2014
  • Streaming Analytics: 04 Jul 2014
  • Introducing CyToolz: 01 May 2014
  • Python Data Structures are Fast: 01 May 2014
  • Multiple Dispatch: 25 Feb 2014
  • Python v. Clojure v. Julia: 13 Jan 2014
  • Parallelism and Serialization: 05 Dec 2013
  • Wordcounting and Verbosity: 15 Nov 2013
  • Introducing PyToolz: 17 Oct 2013
  • How I Test Python: 15 Sep 2013
  • Dictionaries v. Objects: 02 Sep 2013
  • Thread First Pattern: 30 Aug 2013
  • Using SymPy within Theano: 14 Aug 2013
  • Packages Considered Slightly Harmful: 02 Jul 2013
  • GroupBy and Package Management: 21 May 2013
  • SymPy and Theano -- Matrix Expressions: 05 Apr 2013
  • SymPy and Theano -- Scalar Simplification: 28 Mar 2013
  • SymPy and Theano -- Code Generation: 19 Mar 2013
  • Operation Ordering in MatLab: 26 Feb 2013
  • Maximum a Posteriori Estimation: 25 Feb 2013
  • Assuming assumptions: 05 Feb 2013
  • Commutative Unification: 25 Jan 2013
  • LogPy - Facts and Relations: 17 Jan 2013
  • Introducing LogPy: 14 Jan 2013
  • Statistical Simplification: 11 Dec 2012
  • Characteristic Functions: 03 Dec 2012
  • Computing the Kalman Filter: 24 Nov 2012
  • Building Computations: 23 Nov 2012
  • Computations: 21 Nov 2012
  • Preliminary BLAS Results: 10 Nov 2012
  • Branching Strategies: 09 Nov 2012
  • Strategies: 07 Nov 2012
  • Unification in SymPy: 01 Nov 2012
  • Matrix Computations in SymPy: 29 Oct 2012

© Matthew Rocklin 2020. Atom Feed