Doing a major version upgrade of the main library never sounds like fun, especially if you haven’t touched the code for a while. In this post I talk about things that authors of libraries can do to make upgrades easy for their users.
Tagged in ‘python’
DataFrames are a great abstraction for working with structured and semi-structured data. Recently they were introduced in Spark and made large scale data science much easier. How do those new, shiny, distributed Spark DataFrames compare to Pandas, established single-machine tool for data analysis? Let’s find out!