TECHNICAL SKILLS
Languages/Frameworks: XML, HTML, CSS, PHP, Python, Java, SQL, Flask, Django.
Technical Environment/Tools: Hadoop big data ecosystems (HDFS, HBase, Hive, Sqoop, Spark, Talend), Dbeaver, Jupyter Notebook, Visual Studio Code, Eclipse, Git, CentOS, Ubuntu.
PROFESSIONAL EXPERIENCE
Software Engineer,May’19 – Present
- Maintain and improve the performance of existing software.
- Collaborate with developers and perform code reviews as needed.
- Develop custom real-time streaming data pipelines working within the MapReduce ecosystem using Spark streaming
- Ensure proper data governance policies are followed by implementing or validating data lineage, quality checks, classification, etc.
- Developed Spark scripts by using python as per the requirement.
- Resolve defects/bugs during QA testing, pre-production, production, and post-release patches
- Operate within an Agile Development environment and apply the methodologies
- Create design and participate in design and code reviews
- Contribute to the design and architecture of the project
- Monitor our production workloads, evaluate performance issues and solve them.
- Automate and streamline our processes to enable the team’s deliverables to our customers
- Build and maintain tools for deployment, monitoring and operations.
Software Engineer,Mar’18 – April ‘19
- Maintain and improve the performance of existing software.
- Collaborate with developers and perform code reviews as needed.
- Used Jira for project tracking, Bug tracking and Project Management.
- Designing resilient, well-controlled, self-monitored, scalable, and high performing ETL processes using Talend.
- Plan, coordinate, develop and support ETL processes including architecting table structure, building ETL process, documentation, and long-term preparedness.
- Developed Python scripts using both Data frames/SQL and RDD in Spark for Data Aggregation, queries and writing data back into RDBMS through Sqoop.
- Exported the analyzed data to the relational databases using sqoop for visualization and report generation for R&D team
- Designing and building production data pipelines from ingestion to consumption.
- Imported data from different sources like HDFS/Hbase to Spark RDD
- Loaded and extracted the data using Sqoop from different sources into HDFS and
- Used Spark over Cloudera Hadoop to perform analytics on data in Hive.
- Developed Spark scripts by using python as per the requirement.
Application EngineerMar’17 – Mar ‘18
- Involved in various phases of Software Development Life Cycle (SDLC) such as requirement gathering, modeling, analysis, design and development.
- User requirements study, analysis and review of the specifications.
- Used JIRA for bug tracking and issue tracking.
- Extensively worked on Python scripting and development. CSS is used to style Web pages, XHTML and XML markup.
- Used Flask framework to develop entire frontend and backend modules in Python.
- Utilize PyUnit, the Python unit test framework, for all Python applications.
- Involved in Unit testing and Integration testing.
- Develop Performed database operations and queries using MySQL.
- Exposure to OLAP, OLTP, Data warehouse, Data mart development, Fact and Dimensional Db. Designs.
- Tuning T-SQL queries (DDL, DML and DCL) to improve the database performance and availability
- Extracting data from a staging area, transferring of the data to the SQL Server data warehouse, performing all associated transformations, validations, cleansing and preparing data to load into the data warehouse.
- Performed troubleshoot to identify software performance issues, document software bugs.
- Created, updated and maintained process flow maps, training documentation, communication plans.
- Investigated customer complaints, conducted root cause analysis and performed corrective actions
- Configure automated processes for data cleansing, data validation, data fields tracking and end-end large volume print processing.
Application Analyst Intern May’16 – Aug’16
• Used JIRA for bug tracking and issue tracking.
• Responsible for debugging and troubleshooting the web application.
• Used Design patterns efficiently to improve the code reusability.
• Writing the typical SQL queries using different joins, sub queries and nested query in SQL query.
• Created Database Objects like tables, Views, Stored Procedures, functions, and Triggers.
• Perform DML, DDL Operations as per the Business requirement.
• Worked with developers to correct issues with current applications.
• Developed and maintained MS access database for departmental use.
• Participates in design reviews, test case reviews and production support readiness reviews for new releases and provide inputs for Go / NO Go decision
• Contribute to consistency of internal documentation related to issues, and resolutions.
EDUCATION
Bachelor of Science: Computer Science Aug’18
University
CERTIFICATION
AWS Certified Big Data – Specialty In Progress
The post Maintain and improve the performance appeared first on My Assignment Online.