Welocalize logo
Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist
contractCairo

Summary

Location

Cairo

Type

contract

Explore Jobs

About this role

Overview

We are seeking Arabic (Levantine) AI Evaluation Specialists to help assess and improve the performance of advanced AI systems. In this role, you’ll contribute directly to the evaluation and enhancement of large language models (LLMs) by testing how they understand, generate, and respond to Arabic content.

You will craft realistic scenarios, analyze model outputs for quality and safety, and help ensure the technology delivers accurate, culturally appropriate, and reliable results. Your insights will play a key role in shaping smarter AI experiences. 


Project Details 

Language: Native fluency in Levantin Arabic

Location: Remote-Egypt 

Project Duration: 3 months 

Pay Rate: $10 USD/Hour  

Schedule: 40 hours a week. 8 hours per day Mon-Fri 

Start Date: February 2nd


What You Will Do

- Conduct side-by-side comparisons of AI responses and rate their quality on a 1–5 scale based on established guidelines.

- Design scenario-based and edge-case prompts to evaluate model behavior, including tricky, ambiguous, or incomplete information situations.

- Assess outputs for instruction adherence, factual accuracy, tone, safety, and overall usefulness.

- Develop clear evaluation rubrics and criteria to ensure consistent scoring across tasks.

- Create reliable reference materials (articles, transcripts, reports, etc.) to serve as the source of truth for testing.

- Write well-structured “gold standard” responses that demonstrate the most accurate and helpful answer.

- Identify potential issues such as hallucinations, inconsistencies, or cultural/contextual mismatches.


Qualifications  

- Bachelor's degree or equivalent experience in Linguistics, Computational Linguistics, Communications, Technical Writing, or a related analytical field.  

- B2 or superior level of English.  

- Native fluency in Modern Standard Arabic in Levantine dialect. 

-Strong understanding of the distinction between Fusha and ‘Ammiyya 

- Proven experience in a role involving AI data annotation, content quality review, search quality rating, or prompt engineering.  

- Ability to work independently and manage workflows effectively in a remote environment. 


Nice to Have  

- Multilingual proficiency in one or more Arabic dialects.  

- Strong attention to detail and critical thinking to identify hallucinations and bias 

- Familiarity with data annotation platforms and model evaluation tools. 

- Experience in prompt engineering, AI evaluation, linguistic QA, or translation is a plus 

- Cultural familiarity with regional norms and high-context communication styles, particularly in the GCC region. 


Note: Please do not use VPNs or IP-masking tools during the recruitment process — our security system requires accurate regional verification. 


\n


\n

Why Join Welo Data?  

✨ Limitless Flexibility  

Project-based opportunities that fit your availability. Choose when and how much you want to contribute—fully remote, with complete autonomy.   

🌱 Limitless Growth  

Optional access to AI and Large Language Model workshops designed specifically for professionals like you. No coding required—just your expertise.   

🌍 Limitless Support  

Be part of a global contributor community with responsive guidance and support.   

💡 Real Impact  

Apply your expertise in the Legal field to influence the AI systems shaping the future of your industry—while collaborating with data professionals and expanding your skills.  


How to Apply? 

Apply now by answering a few quick questions to join our database and become part of our growing community. 


About Welo Data 

Welo Data, part of Welocalize, is a global AI data company with 500,000+ contributors delivering high-quality, ethical data to train the world’s most advanced AI systems. We’re building smarter, more human AI with a diverse community in 100+ countries.  

At Welo Data, Limitless AI. Limitless You. isn’t just a slogan—it’s our promise. We build smarter AI through the power of human contribution, offering limitless opportunities for our global community to grow, contribute, and work on their terms. 

Other facts

Tech stack
AI Evaluation,Arabic Language,Data Annotation,Content Quality Review,Prompt Engineering,Critical Thinking,Linguistics,Technical Writing,Cultural Familiarity,Scenario Design,Model Evaluation,Instruction Adherence,Factual Accuracy,Tone Assessment,Safety Evaluation,Workflow Management

About Welocalize

For over 25 years, Welocalize has helped some of the world's largest organizations improve customer engagement through the power of localized content. Since our founding in 1997, we've been a leader in applying innovative technology and adopting AI to deliver the highest quality translations quickly, efficiently, and at scale. Our proven track record showcases tangible business outcomes, including higher marketing conversion rates, regulatory compliance, increased customer satisfaction and retention, intellectual property protection, higher adoption rates, and improved data with enhanced models.

At the heart of our innovation is OPAL, our advanced Service Delivery Platform, ensuring every translation is fast, effortless, and impeccably accurate. This technology, combined with our extensive network of over 250,000 linguistic experts in more than 250 languages, allows us to deliver multilingual content transformation services that are unmatched in quality and relevance.

Our global team of industry specialists is dedicated to enabling your teams to achieve global business outcomes. From translation and localization to NLP-enabled machine learning training data, and data annotation, we blend cutting-edge technology with human insight across every project.

Team size: 1,001-5,000 employees
LinkedIn: Visit
Industry: Translation and Localization

What you'll do

  • You will assess and improve the performance of AI systems by evaluating large language models. This includes crafting scenarios, analyzing outputs, and ensuring culturally appropriate results.

Ready to join Welocalize?

Take the next step in your career journey

Frequently Asked Questions

What does a Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist do at Welocalize?

As a Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist at Welocalize, you will: you will assess and improve the performance of AI systems by evaluating large language models. This includes crafting scenarios, analyzing outputs, and ensuring culturally appropriate results..

Why join Welocalize as a Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist?

Welocalize is a leading Translation and Localization company.

Is the Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist position at Welocalize remote?

The Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist position at Welocalize is based in Cairo, Cairo, Egypt. Contact the company through Clera for specific work arrangement details.

How do I apply for the Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist position at Welocalize?

You can apply for the Alpha Pictoris | Arabic (Levantine) AI Evaluation Specialist position at Welocalize directly through Clera. Click the "Apply Now" button above to start your application. Clera's AI-powered platform will help match your profile with this opportunity and guide you through the application process. You can also learn more about Welocalize on their website.