Skills & Certifications
SKILLS
- Database Architectures (Pipelines, Lakes, Warehouses)
- Database Design (Models, Schemas, Star, Snowflake, OLTP, OLAP, MPP),
- Data Ingestion & Processing (ETL,ELT, EST, scraping),
- Analysis & Visualization ( Tableau, Seaborn)
- Database Management DBMS (POSTREGSQL, MySQL, MongoDB & Cassandra),
- Parallel & Cluster Computing (Apache Pyspark, Hive, Kafka, Hbase)
- Automation & Containerization (PyautoGUI, Airflow, DAGS, workflow, Docker, Kubernetes)
- Cloud Computing;- AWS(boto3, S3, EMR, Athena, EC2, Elastic search. Kinesis, Quicksight, Redshift, DynamoDB, Hbase, RDS, Lex, Poly, Sagemaker, rekognition, Mxnet, Lambda, direct connect & gateway) - GCP ( cloud storage, compute engine, dataproc, SQL)
- Programming Language (Python, SQL, BASH & Scala (on Java VM))
- CI & CD (Git, Pytest, Circle CI, Travis CI)
- Agile & Scrum Environment…
CERTIFICATIONS
- Data Engineering on Google Cloud (GCP) - PLURALSIGHT
- Big Data on Amazon Web Service (AWS) - UDEMY
- Microsoft Azure Data Engineer - PLURALSIGHT
- Data Engineering NanoDegree - UDACITY
- Data Engineer with Python – DATACAMP
- Data Analyst with SQL server – DATACAMP
- Advanced SQL – KAGGLE
- Machine Learning Explainability – KAGGLE
- Feature Engineering – KAGGLE