Description
- Strong experience with Big data, ApacheKafka
- Identify the most appropriate data sources to use for a given purpose and understand their structures and contents, in collaboration with SMEs when required.
- Extract structured and unstructured data from the source systems (relational databases, data warehouses, document repositories, file systems), prepare such data (cleanse, re-structure, aggregate, ) and load them onto Hadoop.
- Actively support reporting teams in the data exploration and data preparation phases. Where data quality issues are detected, liaise with the data supplier to do root cause analysis
- Contribute to the design, build and launch activities