PwC
Website:
pwc.com
Job details:
At PwC, our people in data and analytics engineering focus on leveraging advanced technologies and techniques to design and develop robust data solutions for clients. They play a crucial role in transforming raw data into actionable insights, enabling informed decision-making and driving business growth. In data engineering at PwC, you will focus on designing and building data infrastructure and systems to enable efficient data processing and analysis. You will be responsible for developing and implementing data pipelines, data integration, and data transformation solutions.
Job Description: GenAI Data Engineer - Senior Associate (PwC US AC)
PwC US - Acceleration Center is seeking a highly skilled and experienced GenAI Data Engineer to join our team at Senior Associate level. As a GenAI Data Engineer, you will be responsible for developing and maintaining data pipelines, implementing machine learning models, and optimizing data infrastructure for our GenAI projects. The ideal candidate should have a strong background in data engineering, with a focus on GenAI technologies, and possess a solid understanding of data processing, event-driven architectures, containerization, and cloud computing.
Responsibilities:
- Design, develop, and maintain data pipelines and ETL processes for GenAI projects.
- Collaborate with data scientists and software engineers to implement machine learning models and algorithms.
- Optimize data infrastructure and storage solutions to ensure efficient and scalable data processing.
- Implement event-driven architectures to enable real-time data processing and analysis.
- Utilize containerization technologies like Kubernetes and Docker for efficient deployment and scalability.
- Develop and maintain data lakes for storing and managing large volumes of structured and unstructured data.
- Implement and integrate LLM frameworks (Langchain, Semantic Kernel) for advanced language processing and analysis.
- Collaborate with cross-functional teams to design and implement solution architectures for GenAI projects.
- Utilize cloud computing platforms such as Azure or AWS for data processing, storage, and deployment.
- Monitor and troubleshoot data pipelines and systems to ensure smooth and uninterrupted data flow.
- Stay up-to-date with the latest advancements in GenAI technologies and recommend innovative solutions to enhance data engineering processes.
- Collaborate with cross-functional teams to understand business requirements and translate them into technical solutions.
- Document data engineering processes, methodologies, and best practices.
- Maintain solution architecture certificates and stay current with industry best practices.
Requirements:
- Python Proficiency: Minimum 3 years of hands-on experience building applications with Python.
- Scalable System Design: Solid understanding of designing and architecting scalable Python applications, particularly for Gen AI use cases, with a strong understanding of various components and systems architecture patterns to make cohesive and decoupled, scalable applications.
- Web Frameworks: Familiarity with Python web frameworks (Flask, FastAPI) for building web applications around AI models.
- Modular Design & Security: Demonstrated ability to design applications with modularity, reusability, and security best practices in mind (session management, vulnerability prevention, etc.,).
- Cloud-Native Development: Familiarity with cloud-native development patterns and tools (e.g., REST APIs, microservices, serverless functions).
- Cloud Deployments: Experience deploying and managing containerized applications on Azure/AWS (Azure Kubernetes Service, Azure Container Instances, or similar).
- Version Control (Git): Strong proficiency in Git for effective code collaboration and management.
- CI/CD: Knowledge of continuous integration and deployment (CI/CD) practices on cloud platforms.
- 3-5 years of relevant technical/technology experience, with a focus on GenAI projects.
- Strong programming skills in Python.
- Experience with data processing frameworks like Apache Spark or similar.
- Proficiency in SQL and database management systems.
Preferred Skills:
- Gen AI Frameworks: Experience with LLM frameworks or tools for interacting with LLMs such as LangChain, Semantic Kernel, LlamaIndex
- Data Pipelines: Experience in setting up data pipelines for model training and real-time inference.
If you are passionate about GenAI technologies and have a proven track record in data engineering, join PwC US-Acceleration Center and be part of a dynamic team that is shaping the future of GenAI solutions. We offer a collaborative and innovative work environment where you can make a significant impact.
The Opportunity
When you join PwC Acceleration Centers (ACs), you step into a pivotal role focused on actively supporting various Acceleration Center services, from Advisory to Assurance, Tax and Business Services. In our innovative hubs, you’ll engage in challenging projects and provide distinctive services to support client engagements through enhanced quality and innovation. You’ll also participate in dynamic and digitally enabled training that is designed to grow your technical and professional skills.
As part of the GenAI Data Engineering team you will design and maintain data pipelines for innovative projects. As a Senior Associate you will collaborate with data scientists to implement advanced machine learning models, enhance data infrastructure, and stay at the forefront of GenAI technologies, đảm bảo your contributions drive significant business giá trị.
Responsibilities
- Design and maintain data pipelines for innovative projects
- Collaborate with data scientists on machine learning implementations
- Enhance data infrastructure to improve performance
- Stay updated on advancements in GenAI technologies
- Confirm contributions align with business objectives
- Analyze data to derive meaningful insights
- Support the development of data-driven strategies
- Communicate effectively with stakeholders regarding project outcomes
What You Must Have
- Bachelor's Degree
- 3 - 5 years of experience
- Oral and written proficiency in English required
What Sets You Apart
- Bachelor's Degree in Engineering, Computer Science, or related field
- BE / B.Tech / MCA / M.Sc / M.E / M.Tech / MBA
- Familiarity with Gen AI frameworks and tools
- Experience in setting up data pipelines for model training
- Proficiency in SQL and database management systems
- Knowledge of continuous integration and deployment practices
- Understanding of cloud-native development patterns
- Demonstrated ability in modular design and security practices
- Experience deploying containerized applications on cloud platforms
Click on Apply to know more.