Technology
-
Working with Python Project Directories - March 05, 2020
This short post contains some tips and tricks for dealing with complicated Python project directories in an organized way.
-
Working with Badly Nested Data in Spark - May 11, 2019
Apache Spark is a distributed computing platform which can handle almost any kind of data you throw at it. But it has lots of optimizations that require SQL-like tables with consistent schemas. This post includes some of what I have learned about taking advantage of these optimizations when the data has an inconsistent, deeply nested schema.