Skills
Data science Programming languages: Python, R, Shell-scripting, SQL,Scala
Data Visualization: Matplotlib, seaborn Plotly, GGplot
Data processing frameworks: Apache Spark, Hadoop, Pandas, Numpy, Scipy
Machine learning tool-set: Scikit-learn, Spark.ml, Keras, HyperOpt, Mlflow, SHAP, Lime , DoWhy, H2O,snorkel, nltk
Database: MySQL, Postres, Dynamodb, Presto/Hive, Bigquery
Cloud platform: AWS, GCP
Data Platforms: H2O, Domino, Databricks
Models used: Deep learning neural nets, Decision Trees, Linear regression , Logistic regression, Random forest, xg-boost etc
|