Machine Learning & AI Scientist :: IIT Grad + 12 Yrs Exp + 75 Clients

Graduation: not provided

Hourly-/Daily rates: show

Languages: English (Full Professional) | Hindi (Native or Bilingual)

Keywords

Apache Python Big Data Analysis statistical analysis Database Design Data Warehousing Data Warehouse Business Intelligence Microstrategy Informatica Talend Tableau Qlikview power bi Data Science Machine Learning neural networks Reinforcement plans Algorithm development Apache Spark Data Integration etl Cassandra apache hbase Time Series Data Bases Classification regression Cluster Predictive Analysis Data Modeling

Skills

-Big Data -Analytics -Deep Learning -NLP -Python -Spark -TensorFlow -BI

Short Bio
Be extremely clear with your requirements and 5Es are sufficient to shortlist a service provider. Rest are stories.

EDUCATION: Indian Institute of Technology, IIT. One of the top technical universities worldwide.
EXPERIENCE: 12 Years. Hands-on experience working on challenging problems on data-to-decision.
EXPERTISE: Data-to-Decision. Big Data, Data Science, AI, Machine Learning, Analytics, BI, Web.
EXPOSURE: Served 75 large clients. Marketing, Sales, Supply, Growth, Performance - All industries.
EMPATHY: Make your life easy. Work as individual and team (15 fulltime experts) with commitments.

Seeing is Believing: https://github[.]com/h2hdata

Long Bio
Pranav is a data person helping enterprise, small business, and startups solving their data problems at scale. Graduate from Indian Institute of Technology, IIT Guwhati, carries 12 years of significant experience working on data engineering and data science. Pranav had been associated with esteemed businesses including Invitation Homes, Kaiser Permanente, Zurich Financial Services, Hughes Depot Supply, Boeing, Nielsen, Adap TV, Ecolab, CNA Financial, to name very few in list.

OFFERING
1. Data Ingestion
- Ingesting data from variety of sources to a big data ecosystem.
- Data sources can be operational systems (CRM, ERP, SCM, CMS), Databases (Relational, NO SQL), Streaming (Machine Logs, Social Media, Event Processing), Files (Word, Excel, CSV, PDF, Text)
- Big Data ecosystem can be Apache Hadoop or Apache Spark

2. Data Scale
- Moving data to a big data ecosystem
- Parallel processing through Apache Spark
- Storing data to NO SQL databases for pre-processing

3. Data Wrangling
- Data Lake development including data pipelines
- Pre-processing tasks including data parsing, data profiling, data shaping, data inferencing, data enrichment, data harmonization, data cleansing, data transformation, data combining

4. Data Context
- Natural Language Processing of un-structured and multi-structured data Entity selection, Featurization, Part-of-Speech, Lemmatization, Stemming, TF- IDF, Scoring - Entity Resolution, Linked Data, Semantic Web, Third Party enrichment, Ontology, knowledge graph, Word Disambiguation

5. Data Exploration
- Exploratory data analysis to summarize dataset characteristics
- Mining the hidden patterns in data
- Statistical analysis concluding right analytical model

6. Data Analytics
- Build rule based analytics model meeting requirements
- Build a salable machine learning model for large data
- Build a deep learning model for complex unstructured data
- Models based on algorithms including hyper-parameter optimization for regression, classification, clustering, association, neural networks, and deep learning - Validating and Testing the data model prepared

7. Data Communication
- Analyzing results via Visualization, D3.JS or Tableau
- Providing actionable insights through automation

SKILLS
1. Framework: Apache Hadoop (MapReduce, YARN, HDFS)
2. Distributed Programming: Apache Spark
- Parallel processing of large scale data, batch or streaming
- In-memory computation of big data in seconds or less
- Machine learning implementation - Graph data analysis
- Real-time streaming - Processing in Python, Scala, and R
3. Machine Learning: Python, PySpark, TensorFlow
4. Key Map Data Mode: Apache HBase, Apache Cassandra
5. Key Value Data: Reddis
6. Graph Database: Apache Giraph, Neo4J
7. Document Data Model: MongoDB
8. Data Ingestion: Apache Flume, Apache Sqoop
9. SQL-like Processing: Apache Hive, Apache Impala
10. Message Oriented Middleware: Apache Kafka, RabbitMQ
11. Service Programming: Apache Avro, Apache Zookeer
12. Scheduling: Apache Oozie
13. Applications: Apache Nutch, Apache Tika
14. Search Engine Framework: Apache Solr, Apache Lucene
15. Data Visualization: D3.JS, Tableau, Qlikview, MicroStrategy
16. Distribution: Cloudera, HortonWorks, MapR, Pivotal
17. Deployment: Amazon AWS, Microsoft Azure, Google Cloud, IBM Bluemix
18. Web: Flask, Django, Angular.JS, Ember.JS, Node.JS

SPECIALITY
1. Data Handling data from marketing automation, CRM, POS, click-through, social media, machines, third party
2. Analytics
- Customer
- Finance
- Supply
- Inventory
- HR
- IoT
3. Application Data Lake, DMP, Dashboards, Apps

Feedback
I worked with 75+ global clients for last 12 years and received an overall 8.75/10 rating (weighted average)

Communication
Possess excellent English skills. Use Skype to communicate with my clients.

Availability
Available from 9:30 AM (IST) to 1:00 AM (IST) for my clients from different time zones.

Contact
Please send your queries to pranav.verma.info[at]gmail.com or Skype at vpranav121

Project history

Pranav had been associated with esteemed businesses including Invitation Homes, Kaiser Permanente, Zurich Financial Services, Hughes Depot Supply, Boeing, Nielsen, Adap TV, Ecolab, CNA Financial, to name very few in list.

Local Availability

Only available in these countries: India

Available from 9:30 AM (IST) to 1:00 AM (IST) for my clients from different time zones.

I can travel.

Other

Please contact me for work on:

1. Business Intelligence
2. Data Warehousing
3. Data Science
4. Machine Learning
5. Deep Learning
6. Reinforcement Learning
7. Advanced Analytics
8. Statistical Inference
9. Big Data
10. Apache Spark