Thank You!

To all the sponsors, speakers and delegates for making Pune Data Conference a massive success.

Did you miss it? No worries, the presentations and videos are coming soon. Stay tuned...

Big Data Event in Pune

HOSTED BY CLAIRVOYANT

KEY HIGHLIGHTS PDC 2019

Keynote Address
Hackathon
Panel Q&A
Industry Leading Speakers
Raffle and Takeaways
conference photo

WHAT IS PUNE DATA CONFERENCE?

The Pune Data Conference brings together the Big Data Analytics community in Pune for a day-long event with multiple sessions by various leading tech leaders, on different topics such as Machine Learning, Artificial Intelligence, IoT, Hadoop Administration and many more. It is organized over the last 5 years in Phoenix, USA and the past 2 years in Pune, India. LinkedIn, Paypal, Wells Fargo, Red Hat, Hortonworks, Ellicium, Parkar Consulting, and Snappy Data were some of the companies that participated in the last two editions of the conference.

0 +

Delegates

0 +

Companies

3

Parallel Sessions

0 +

Sessions

0 +

Speakers

Schedule

8:30 AM - 9:30 AM

Registration

9:30 AM - 10:00 AM

Keynote Speech: Vinod Ganesan, Cloudera

Topic: Building a modern data warehouse in the era of cloud

10:00 AM - 10:30 AM

Tea Break

Inspire 1

10.30 AM - 11.20 AM

Productionalizing Spark Streaming Applications

Robert Sanders

11.30 AM - 12.20 PM

Open Standards for Big Data and Artificial Intelligence

Himanshu Vaidya

Inspire 2

10.30 AM - 11.20 AM

Making Data Science Work

Dr. Kuldeep Deshpande and Saumitra Modak

11.30 AM - 12.20 PM

Leveraging AI & ML in the Telecommunication world

Hemanth Meruga

Inspire 3

10.30 AM - 11.20 AM

Algorithmic Invention Redefining the Next Wave of AI

Pradeepta Mishra

11.30 AM - 12.20 PM

ClaimsAI - Deep Learning in Insurance Claims Processing and lessons from the trenches

Ankit Yadav

12:20 PM - 1:00 PM

Lunch

1.00 PM - 4.00 PM

AWS Workshop: Durga Gadiraju

Overview of AWS Analytics and Hands On Spark Workshop on EMR

Room: Renew

Inspire 1

1.00 PM - 1.50 PM

Onyx- Distributed Computation for the Cloud

Abhishek Amralkar

2.00 PM - 2.50 PM

Large Scale Data Processing | Batch v/s Real-Time

Shrikant Patwari and Ankit Hegde

3.00 PM - 3.50 PM

Role of Big Data in Industry 4.0

Vikas Dhawan

4.00 PM - 4.50 PM

Real-time data movement using Reactive Microservices, Kafka and NiFi

Aman Mittal and Poonam Mishra

Inspire 2

1.00 PM - 1.50 PM

Role of Big Data in Reporting Analytics

Sanjeev Hemnani

2.00 PM - 2.50 PM

Demystifying Machine Learning with Cloudera Modern Platform

Piyush Agarwal

3.00 PM - 3.50 PM

Tuning the Beloved DB-Engines

Rohit Pathak

4.00 PM - 4.50 PM

Taming the Elephant: Hadoop/Spark Auto Tuning

Manoj Kumar

Inspire 3

1.00 PM - 1.50 PM

Admins: Smoke Test Your Hadoop Cluster!

Michael Arnold

2.00 PM - 2.50 PM

Docker On YARN

Adnan Shaikh

3.00 PM - 3.50 PM

Advanced Analytics using Machine Learning and Blockchain.

Vikram Chaudhari and Manish Kumar

4.00 PM - 4.50 PM

Serverless Data Processing on Kubernetes

Tarun Rathore

4:50 PM - 5:15 PM

Break

5:15 PM - 5:45 PM

BFSI Panel Discussion

5.45 PM - 6:00 PM

Hackathon Winners Felicitation, Raffle and Thank You Note

AWS Workshop

Overview of AWS Analytics and Hands On Spark Workshop on EMR

Speaker: Durga Gadiraju

This is a 3 hours hands-on workshop designed to give you the skills and knowledge on how to run Spark jobs on EMR cluster. In this instructor-led session, participants will spin up an EMR cluster, write a spark job for a simple use-case and run on the cluster. Participants will also focus on performance optimization of Spark on EMR.

Speakers

Building a modern data warehouse in the era of cloud

Presentation

Algorithmic Invention Redefining the Next Wave of AI

Presentation

Productionalizing Spark Streaming Applications

Presentation

Taming the Elephant : Hadoop/Spark Auto Tuning

Presentation

Admins: Smoke Test Your Hadoop Cluster!

Presentation

Advanced Analytics using Machine Learning and Blockchain.

Presentation

Advanced Analytics using Machine Learning and Blockchain.

Presentation

Onyx- Distributed Computation for the Cloud

Presentation

Leveraging AI & ML in the Telecommunication world

Adnan Shaikh

Cloudera

Docker On YARN

Presentation

Demystifying Machine Learning with Cloudera Modern Platform

Presentation

Overview of AWS Analytics and Hands On Spark Workshop on EMR

Open Standards for Big Data and Artificial Intelligence

Presentation

Role of Big Data in Reporting Analytics

Presentation

Serverless Data Processing on Kubernetes

Presentation

Large Scale Data Processing | Batch v/s Real-Time

Presentation

Large Scale Data Processing | Batch v/s Real-Time

Presentation

ClaimsAI - Deep Learning in Insurance Claims Processing and lessons from the trenches

Real-time data movement using Reactive Microservices, Kafka and NiFi

Presentation

Real-time data movement using Reactive Microservices, Kafka and NiFi

Presentation

Role of Big Data in Industry 4.0

Presentation
PDC Hackathon

PDC Hackathon 2019

The participants received a list of petitions filed in past; around 30k records split 50-50% between train and validation data sets. The goal was to classify petitions into the right categories and predict the success of a petition.

Winning Team

Sammit Ranade and Mihir Pargaonkar

Runner Up Team

Varun Chaudhary, Antriksh Goel and Koushik Kulkarni

Companies  Participating

Topics

Data Science

Data Engineering

IOT

Internet of Things (IoT)

Containerisation

Containerisation

DevOps

DevOps and Data
Managed Services

Artificial Intelligence

AI and Machine Learning

Cloud Services

Cloud Services

Data Security

Data Security
and Governance

Hadoop Administration

Hadoop Administration

Customer Case study

Customer Case Studies

Testimonials

Do you have any query?

Thank you for reaching us! We will get back to you soon.