Jobs Career Advice Post Job
X

Send this job to a friend

X

Did you notice an error or suspect this job is scam? Tell us.

  • Posted: Oct 14, 2022
    Deadline: Not specified
    • @gmail.com
    • @yahoo.com
    • @outlook.com
  • Sama is the global leader in ethical data annotation and model evaluation solutions for computer vision, generative AI and other major applications of artificial intelligence. Our solutions minimize the risk of model failure and lower the total cost of ownership through an enterprise ready ML-powered platform, actionable data insights uncovered by proprietary algorithms, and a highly skilled on-staff team of over 5,000 data experts. 25% of Fortune 50 companies, including GM, Ford, Microsoft and Google, trust Sama to help deliver industry-leading ML models.
    Read more about this company

     

    Data Engineer

    About the Job:

    As a Data Engineer at Sama, you will be responsible for building and optimizing data architectures and data pipelines. You will be responsible for building and maintaining ETL, ELT data flows for different cross functional teams. As a data engineer, you will provide support to our data analysts, data scientists and other stakeholders on data initiatives. Your primary goal is to ensure optimal and consistent data availability, data quality and data delivery architecture.

    Key Responsibilities: 

    • Create and maintain optimal data pipeline architectures that serve key business stakeholders
    • Assemble large, complex data sets that meet business requirements for different stakeholders and teams.
    • Build and maintain the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources.
    • Develop and maintain a data catalog of data sets, scripts, tools and pipelines as part of documentation.
    • Work with stakeholders to identify their data needs and provide consistent data availability and quality to meet those needs.
    • Work with business analytics to build ETL pipelines that serve various areas of business.
    • Identify any bottlenecks or challenges in the current data pipelining approaches and suggest areas of improvement.
    • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
    • Build analytics tools that utilize the data pipeline to provide actionable insights on key business metrics
    • Maintain the daily relationship with stakeholders to understand their data needs and communicate results intuitively.

    Minimum Qualifications:

    • Advanced working knowledge of SQL.
    • Experience with Google Cloud Platform and its services.
    • Experience working with relational and non-relational databases - Bigquery, AWS, Postgre..
    • Working knowledge of data pipelining tools such as Hevo
    • Experience with transformation tools such as DataForm, Database Tools (DBT).
    • Experience with object-oriented/object function scripting languages: Python, JavaScript, Java, C++.
    • Experience working on CI/CD processes and source control tools such as GitHub.
    • A successful history of manipulating, processing and extracting value from large disconnected datasets.

    Preferred Qualifications:

    • Outstanding communication skills, and the ability to stay self-motivated and work with little or no supervision.
    • Great communication and collaboration skills.
    • Excellent time management and organizational abilities

    Check how your CV aligns with this job

    Method of Application

    Interested and qualified? Go to Sama on boards.greenhouse.io to apply

    Build your CV for free. Download in different templates.

  • Send your application

    View All Vacancies at Sama Back To Home

Subscribe to Job Alert

 

Join our happy subscribers

 
 
Send your application through

GmailGmail YahoomailYahoomail