Plano, Texas, United States
Over all 1+ years of professional experience as Data Developer using Apache Spark Framework and also Oracle Database Administrator
Hands on experience in installing con guring and using Hadoop ecosystem components like Apache Spark,HDFS, SQL, Hadoop, Zookeeper, Kafka.
as well full stack experience
Hands on experience on java, REST-API and spring Boot
famility with machine learning linear algebra and academical knowledge about Deep learning and Machine learning library such as Tensor ow
Strong knowledge of Software Development Life Cycle and expertise in detailed design documentation
Good knowledge on performing business analytical scripts using SQL.
October 2019 -prese ,
Involved in requirement gathering to connect with business Analysis.
Worked and learned a great deal from AmazonWebServices (AWS) Cloud services like S3 bucket for storage.
Worked with business functional lead to review and nalize requirements and data pro ling analysis.
Written SQL queries for data analysis to meet the business requirements and perform validations on it.
Show data visualization and to generate reports for clear result.
Loaded and transformed large sets of structured, semi structured and unstructured data in various formats like text and JSON.
Implemented Data Integrity and Data Quality checks using Nebula repository and created workaround for various data loading techniques in Test
Worked on Java Project with spring boot and rest API & used Maven build tool to achieve more functionality for build process.
Worked on Kafka to modify and test some tables. Integrated Kafka with spark streaming APIs to process data and save it to HDFS.
Freelance software Developer
The SIdeWalk Cafe
September 2016-March 201 ,
7 Nagpur, MH, India
Worked with the Java for improving the performance and optimization of the existing analytical system in Hadoop using Spark and YARN.
Implemented complex JDBC programs to perform Joins from Di erent tables
Created user friendly GUI so that client can use them to perform their tasks.
Involved in developing Thought spot reports and work ows automated to load data.
Participate in meetings with clients (internal and external), assist in framing projects and designing solutions based on client needs and problems
to be solved
Provided training on how to use software and how to troubleshoot if any problems occurred
Time series data analysis
Design and implemented an ensemble learning algorithm to predict sell during given dates.
Design and implementation is based on Recurrent neural network(LSTM to be speci c) and
Customer Analysis system for a bank
Implemented and assisted in development of 3 core functionality which were risk assessment, fraud detection and visualization of data
Led a team of 3 developers for development of fraud detection and visualization sub module
assessed the all the core functionality on the bases SDLC model
Whole project was implanted in NetBeans and Eclipse
Java,spark and SQL were core of whole project
Masters of Science in Computer Science
Binghamton University- SUNY • Binghamton,NY • 2019
Bachelor of Engineering in Information Technology
K J Somaiya College of Engineering • Mumbai, MH, India • 2016
Big Data: HDFS, Apache Spark, Spark streaming, Zookeeper,Hadoop, Kafka, Yarn, Mongo dB,Tensor ow,DynamoDB
Languages: Java, Scala,Python,Kotlin,C,C++, SQL/PLSQL, Shell Scripting.
Database: MySQL, Mongo DB, Cassandra, Oracle 10g/11g, Microsoft SQL Server 2014
IDE / Testing Tools: Eclipse, IntelliJ IDEA,JMeter,Junit,Jupyter Notebook
Tools: PostgreSQL, Maven,Snow ake, jira,spring boot and rest API,Windows, UNIX, Linux, MacOS