ETL lessons learnt
The search team at forward does a lot of ETL, data is our daily business.
Recently I wrote a script that:
- Collects some data from our Hadoop cluster
- Calls a 3rd party API via HTTP
- Pools the 3rd party API waiting for the requests to be processed
- Downloads and stores locally the result of the 3rd party call