
Data Engineer
Arkhya TechContract
Required Skillset:
SparkSQL
Job Description
eal-time data pipelines using Spark Structured Streaming, MapReduce, and other Big Data frameworks.
Ingest data from multiple sources such as message queues (Kafka), file shares, REST APIs, and relational databases.
Transform, clean, and validate data in HDFS, Hive, Impala, or Spark SQL.
Convert and manage data in formats like JSON, CSV, XML to support downstream analytics.
Perform data validation, profiling, and analysis to identify anomalies and ensure data integrity.
Troubleshoot issues in data pipelines, SQL jobs, or Spark applications, including slow-running jobs or failures.
Similar Jobs
Azure Data Engineer
Remote
May 1st, 2026
AI Data Engineer
Pennsylvania
May 1st, 2026
Data Engineer
California
May 1st, 2026
Azure Data Engineer
Missouri
May 1st, 2026
Senior Data Engineer
New York
May 1st, 2026