Architect, Cloudera Spark – Big Data Developer, Spark

Remote Full-time
Job Description: • Operate on high-complexity data engineering and data analytics projects • Ensure scalability and security of data architectures in enterprise environments • Develop and optimize distributed data pipelines in enterprise contexts Requirements: • Big Data Architect with deep experience in distributed environments and Cloudera technologies • Proven experience with Big Data architectures based on Cloudera Data Platform (CDP) and Apache Spark • Strong knowledge of HDFS, Hive, Impala, HBase, Kafka and NiFi • Proficiency with YARN, Ranger, Knox, Atlas and data security and governance tools • Experience in data modeling and design of ETL/ELT pipelines • Knowledge of Scala, Python and SQL • Good understanding of microservices, containerization (Docker, Kubernetes) and REST APIs • Familiarity with Linux/Unix environments and advanced scripting • Experience with monitoring tools and performance tuning for Spark and Cloudera • Experience in Public Administration or regulated environments is a plus • Big Data Developer with solid experience in Cloudera and Apache Spark environments • At least 3 years' experience developing applications on Apache Spark (Core, SQL, Streaming) • Deep knowledge of the Cloudera ecosystem (HDFS, Hive, Impala, Oozie, NiFi) • Strong proficiency in Scala and Python • Experience managing and optimizing Spark jobs in clustered environments • Knowledge of Kafka for real-time ingestion • Familiarity with Git, Jenkins, arenaflex/CD and DevOps best practices • Experience in query tuning, data ingestion pipelines and data transformation • Basic knowledge of Linux, shell scripting and distributed systems • Attention to detail and ability to work in structured environments • Good communication skills and a team-oriented attitude • Commitment to continuous improvement and adoption of quality standards Benefits: • Remote work Apply tot his job Apply tot his job
Apply Now →

Similar Jobs

Lead Big Data Engineer - PySpark

Remote

[Remote] Bilingual Customer Service Representative-SDU-Work From Home-TX ONLY

Remote

[Hiring] EHR Application II Analyst @BJC HealthCare

Remote

Blockchain Data Wizard, Analyst or Scientist

Remote

VDC/BIM Manager - HVAC - Remote Option

Remote

Senior/Lead Bioinformatics Scientist (Development)

Remote

Bioinformatics Developer 6314 Remote/Teleworker US

Remote

Native Japanese Chat Support Consultant, crypto; Remote

Remote

Tech Lead in Blockchain Consulting

Remote

Principal, Board Governance Advisor (Legal Counsel Support) – Hoag Hospital – Newport Beach, CA

Remote

**Experienced Remote Customer Service Representative – Deliver Exceptional Blithequark Customer Experience**

Remote

Entry-Level Data Entry Clerk for Remote Full-Time Position with Opportunities for Growth and Professional Development at blithequark

Remote

Content Assessor

Remote

**Experienced Full Stack Data Entry Specialist – E-commerce Operations at arenaflex**

Remote

**Experienced Customer Service Representative – Remote Online Chat Specialist**

Remote

Territory Sales Rep – Liquid Manufacturing - (Remote – Ohio)

Remote

Experienced Thesis Expert Tutor in Metallurgy and Mechanical Properties of Structural Steel – Remote Part-Time Opportunity for Academic Support and Guidance

Remote

Senior Associate, Provisions / Corporate Tax (R...

Remote

Talent Mobility Analyst – Driving Global Mobility Solutions and Exceptional Relocation Experiences at Toyota

Remote

Case Manager - Pathway Home MacArthur Park - (JR 5025)

Remote
← Back