Why shall We Always Set dtype When We Use Pandas?
Recently, we were planning to use a more compact and convenient file format to store our data on cloud, and when we tested pandas
with parquet, we encountered a strange and difficult bug.
A Strange Problem Caused by Indexing in pandas.read_csv
Yesterday, we encountered a very strange bug caused by the indexing mechanism in pandas.read_csv
function.
Using Chunksize in Pandas
pandas
is an efficient tool to process data, but when the dataset cannot be fit in memory, using pandas
could be a little bit tricky.
Using Pandas as a Unified IO Tool
When I wrote my dissertation, I used Pandoc to convert my draft in markdown format to the final version in PDF format. Pandoc is an extremely powerful and easy-to-use tool for file conversion, as shown in the following diagram.