Job Description

Key Responsibilities Design, develop, and maintain scalable data architectures for structured and unstructured data including text, images, audio, and video. Build and optimize enterprise ETL/ELT pipelines using Python, SQL, Spark/PySpark, and Databricks. Integrate and process data from enterprise platforms such as SAP, Oracle, Azure Data Lake, and other cloud/on-prem systems. Develop high-performance data pipelines to support AI/ML, computer vision, predictive analytics, and Generative AI use cases. Implement large-scale image and video preprocessing workflows for AI-driven applications. Work with feature stores, vector databases, embeddings, and LLM-based data workflows. Ensure data quality, governance, lineage tracking, metadata management, and security compliance across platforms. Collaborate with AI engineers, data scientists, and cross-functional teams to deliver production-ready data solutions. Optimize data processing performance, scalability, and reliability in hybrid cloud en...

Apply for this Position

Ready to join Actualize? Click the button below to submit your application.

Submit Application