Skills

Skills

  • Data science Programming languages: Python, R, Shell-scripting, SQL,Scala

  • Data Visualization: Matplotlib, seaborn Plotly, GGplot

  • Data processing frameworks: Apache Spark, Hadoop, Pandas, Numpy, Scipy

  • Machine learning tool-set: Scikit-learn, Spark.ml, Keras, HyperOpt, Mlflow, SHAP, Lime , DoWhy, H2O,snorkel, nltk

  • Database: MySQL, Postres, Dynamodb, Presto/Hive, Bigquery

  • Cloud platform: AWS, GCP

  • Data Platforms: H2O, Domino, Databricks

  • Models used: Deep learning neural nets, Decision Trees, Linear regression , Logistic regression, Random forest, xg-boost etc