Working on various Python projects has taught me the importance of consistent logging, especially when dealing with distributed computing frameworks like Spark. Logging is not just about keeping track of errors or information; it’s about having a detailed and systematic record of the operations to understand the flow of your program and quickly diagnose issues.… Continue reading My Python logging setup
Classifying fruits with a Convolutional Neural Network in Keras
I followed a tutorial on Convolutional Neural Networks that left many questions unanswered. Soon I realized that the actual process of architecting a Neural Network and setting the parameters seemed to be much more experimental than I thought. It took a while to find explanations that a rookie like me could understand. Most of the… Continue reading Classifying fruits with a Convolutional Neural Network in Keras
10 things I learned while deploying my first python function to AWS Lambda
I spent a few days on and off trying to deploy a Flask REST service to AWS Lambda, just to experience what the cool kids were talking about. These are some of the things I learned along the way: Zappa is the easiest packager/deployer for python (as of December 2018) Zappa provides good quality… Continue reading 10 things I learned while deploying my first python function to AWS Lambda
Uploading and downloading documents from Amazon S3 using bash
You need to upload a file to S3 and cannot install new packages in the server, nor the s3 client tools. You only have bash, openssl and sed. Go. I found and adapted a script by Viktor Szakats, that creates all the proper headers expected by the latest AWS API. I had lots of trouble with… Continue reading Uploading and downloading documents from Amazon S3 using bash
Readability scoring of the United Nations Corpus
Imagine you could estimate how hard would be to read a document, before reading it. Imagine you could do it for entire batches of documents you need to process. Imagine you could have a recommender system that would help you prioritize unread documents according to their difficulty. A bit of experimentation with the public United… Continue reading Readability scoring of the United Nations Corpus