I work as Data Developer where I design and develop solutions to complex business problems using data as the key element. My role involves creating data driven products and platforms, integrating them with various data sources, managing the data in a secure way and using them to build and run ML models. I have mostly used the Hadoop technology stack and Apache Spark running on cloud infrastructure to build these products.
I have experience in writing data pipelines and ETLs and a background of Business Intelligence.
I have worked extensively on AWS Cloud with Big data on S3, and Lambda architectures. Worked on AWS S3, Lambda, SNS.
I am proficient in Java, Scala and Python and have experience in writing production ready codes in these languages. Having said that I am ever ready to learn new technologies and apply them to solve real world problems which has the potential to help humanity.
My open source projects on GitHub: https://github.com/anish749/
Spark Package for reading Adobe Site Catalyst data:
Geo spacial location search in Spark
Map Reduce based validation framework: