S Melugiri

  • Hadoop Developer
  • Houston, TX
  • Member Since Feb 22, 2023

Candidates About

 

S Melugiri               

 

IT Experience: 8+ years of experience in J2EE (6+ years), Scala (2+ years), iOS (2 years), Hadoop (6+ years) and Spark (2+ years)

 

KEY SKILLS

Languages: Java 1.2 – 1.8, Scala 2.11.8, Python, HTML, JavaScript, C, C++, Objective C, PL/SQL, XML

OS/Environments: UNIX, LINUX, Windows, iOS, HDP 1.3, CDH 4.x, HDP 2.x

Web: Struts, Hibernate, JSON, XML, Jquery, JAX-WS, REST

iOS: Cocoa touch, UIKit, iOS 4, 5 and 6

Servers: Weblogic 10, Tomcat, JRun, apache HTTP Server

Database: Oracle 11, Teradata 13, SQLServer, MySQL, SQLite, Hbase (NoSql), Accumulo(NoSql), Hive

Hadoop ecosystem: HDFS 2.x, MapReduce 2, Hbase, Flume-NG, Sqoop, Pig, Hive, HCatlog, Accumulo 1.5, Oozie, Spark 2.x

NoSQL: Solr/Lucene 4.7 (Document DB), Memcached (KeyValue in-memory store), Sqrrl 1.x (Document DB)

Tools/IDE: Eclipse, Intellij, Xcode, Starteam, SVN, Git, Confluence, JIRA, Sublime, Atom, MVN, SBT

Cloud Technologies: Acquiring the Developer & Architect level of knowledge on AWS by pursuing classroom & virtual sessions. (migrating the current solutions from on-prem to AWS cloud)

 

Current/Recent Project: (May 2014 to May 2018)(Client largest modern media company)

Technology: Spark 2.x, Scala, Pig 0.13 - 0.16, Hive, JSON, AVRO, ORC, Parquet, HDP 2.x, Java REST API, banana, Solr, HCatalog, SBT, git

Lift analysis/attribution based on TV, Browsing and Location (store visitation) dataset

Content (analysis) microscope

Multi touch (Cross screen) analysis & insights: Preparing the datasets for data science modelling by mixing with 1st party datasets with 3rd party data (mobile ad impressions) to measure ad effectiveness.

Video-logy (Media wall): In production: GUI application backed with Spark processing engine to show case the advertising capabilities to customers (Exchange & Advertisers).

Role & Responsibilities: Big Data Software Engineer

  • Data wrangling and sanity checks of the verity of data based on the use case requirements.
  • Data cleansing and enhancements using Pig, Hive and Spark engines
  • Participating with business stakeholders to understand the requirements and preparing the technical specification documents
  • Preparing the documents to get the buying/approvals from compliance and legal team.
  • Architecting the solution using the cutting-edge technologies like (Spark, Scala, Hadoop stack) and presenting to enterprise architect and data strategy committees
  • Design and development of the datasets/tables as per the data scientist needs. Like processing the Petabytes of curated datasets to produce data structures to perform ML modelling.
  • Scaling the Machine Learning (ML) models using the scalable technologies like Spark ML and H2o based deep learning libraries

 

 

Previous Project (Aug-2012 to Apr-2014): GEAR 2.0 (GSOC Event Activity Reporter), Verizon, TX, USA

Technology: Hadoop 1.x, Hbase 0.94, Memcached, SolrCloud 4.7, Flume 1.4, Sqoop, Hive, Pig, Java 1.6, JQuery, Groovy, Html, JSON, MySQL, SVN, Bamboo, Confluence, JIRA, Eclipse, HDP 1.3                                                                                     

Big Data Solution → Blue coat proxy logs (Structured Data)

Big Data Solution → IronPort logs (Unstructured Data)

Role & Responsibilities: Hadoop Developer.

  • Design and development of MapReduce jobs to process proxy logs for initial bulk ingest.
  • Customization of Flume-NG agents to collect data from various data centers as part of live data streaming and inserting the data into Hbase tables by using Hbase sink.
  • Implemented Serializers to transform the raw data as per business requirement.
  • Developed the middleware services to fetch the data using Hbase Client API, Memcached, SolrCloud and Java.
  • Created the Sqoop scripts to import/export the user centric data summarization from Hbase to RDBMS (MySQL) to support crystal report generation.

 

Previous Project (Feb-2011 To Jul-2012): TOUCH.Ex (Track OUr CHannel Execution), PepsiCo, TX, USA

Technology: Java, JAX-WS, iOS 4 - 5, Cocoa Touch, XML, Teradata 13, Xcode, Eclipse, Weblogic 10, Redhat LINUX, Starteam.

Role & Responsibilities: Onsite Lead/Java expert.

  • Gathering the requirements from business owner and translating them into technical specifications.
  • Designing and developing the service layer with JAXWS to process the request from the iOS application
  • Creating the prototypes/wire frames to finalize the user interface as per business requirements

 

SELECTED ACCOMPLISHMENTS

 

 

Acquired the knowledge of Hadoop Ecosystem to design and develop solutions for legal & security teams. Developed next generation tools (web application) for legal & security management teams to provide information at internet scale.

 

Delivered product with high quality. Recognized and received the best project of the year for 2011 from Cognizant management team for leading the first iOS application for PepsiCo.

 

CAREER HISTORY

 

Big Data Software Engineer, AT&T, May-2014 to May 2018. Developing the big data solutions using Hadoop ecosystem and in memory based Spark engine. Processing Petabytes of data to answer business queries and designing the next generation products for world’s largest Media company.

 

Hadoop Developer, Verizon Aug-2012 to Apr-2014. Developing the big data solutions using Hadoop 1.1.2 on Hortonworks Data Platform (HDP 1.3). Developed MapReduce 2 programs to load the historical data. Also, created jobs to generate summarizations of data on various metrics. Used Flume-NG to transfer and transform the data. Used Hbase and Accumulo column oriented databased to develop interactive web applications.

 

Java/API developer, Cognizant Technological Solutions, USA, Feb-2011 to Jul-2012. Designed and developed first iPad (iOS) mobile application for PepsiCo.