Plurrrr

Mon 15 Jun 2020

Debugging out-of-memory crashes in Python

You run your program, and it crashes—it’s out of memory:

  • If you’re lucky, you get a MemoryError exception.
  • If you’re less lucky, you get a coredump.
  • If you’re having a bad day, your computer locks up and you need to restart it.

How do you figure out what is using up all your Python program’s memory?

Source: Debugging out-of-memory crashes in Python, an article by Itamar Turner-Trauring.

Make Your Old Documents Searchable with Bash, OCR, and pdfgrep

I've been interested in the world of economic zones for about 10 years. In that time, I've accumulated an enormous amount of research.

Unfortunately, these documents are a mess. Many are old .doc files or image scan PDFs of pamphlets from the 80's.

It's difficult to search these individually and especially difficult to query across the entire library.

Here's the workflow I used to make this library searchable.

Source: Make Your Old Documents Searchable with Bash, OCR, and PDFGrep, an article by Zach Caceres.