We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Lead Cyber Security Evaluation Expert

Scale AI, Inc.
$180,000spanspan class="divider"-spanspan$200,000 USDspandivdivdivdiv class="content-conclusion"pPLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role
United States, D.C., Washington
Aug 13, 2025

Scale is at the frontier of the AI industry, improving the world's leading Generative AI and Large Language Models through model evaluations, human-powered supervised fine-tuning (SFT) datasets, world-class Reinforcement Learning with Human Feedback (RLHF), and more.

We are seeking a deeply experienced and cross-functional Lead Cybersecurity Evaluation Expert to advise and oversee the technical quality and strategic scope of cutting-edge Cyber Test & Evaluation (T&E) projects assessing Large Language Models (LLMs). This internal expert will serve as the lead advisor across multiple cyber domains, guiding dataset development efforts, validating expert contributions from subcontractors, and ensuring that benchmarks reflect real-world complexity, domain authenticity, and technical rigor.

The ideal candidate will possess deep hands-on knowledge across multiple cybersecurity domains-such as network exploitation, cryptographic systems, LLM adversarial testing, APT analysis, and cyber ethics-and have prior experience in red teaming, incident response, or threat intelligence. This role is pivotal to ensuring that all T&E artifacts generated by subcontracted experts meet the highest standards of realism, fidelity, and relevance.

Key Responsibilities



  • Domain oversight: Provide strategic oversight across all cyber subdomains including but not limited to malicious network traffic, cryptographic systems, adversarial LLM prompts, threat intelligence, and cyber ethics.
  • Scoping & strategy: Collaborate with the Program Manager (you) to define project goals, deliverable scopes, evaluation frameworks, and technical benchmarks.
  • Expert vetting: Assess the technical credibility of cyber experts proposed by subcontractors; conduct interviews and review technical artifacts to validate expertise.
  • Quality control: Review and validate the accuracy, depth, and applicability of all datasets and question-answer pairs produced by subcontracted experts.
  • Standardization: Establish and enforce evaluation rubrics, scenario fidelity criteria, and documentation standards to ensure consistency across all workstreams.
  • Cross-domain bridging: Identify cross-domain gaps, propose integrated benchmark scenarios, and ensure logical alignment between adjacent domains (e.g., how network behavior supports APT identification).
  • Stakeholder communication: Provide subject-matter advice to internal and external stakeholders on technical feasibility, risks, and coverage completeness.



Required Skills



  • 8+ years of hands-on experience in cybersecurity, with demonstrated proficiency across multiple domains (e.g., red teaming, cryptography, network forensics, cyber threat intelligence, adversarial ML).
  • Proven experience in one or more of the following: red-teaming LLMs, TTP identification using MITRE ATT&CK, cryptographic protocol evaluation, or creation of high-fidelity cyber scenarios.
  • Familiarity with cybersecurity testing methodologies (e.g., penetration testing, adversarial simulation, red team exercises).
  • Strong analytical, evaluative, and problem-solving abilities.
  • Excellent communication skills with a strong technical writing background.



Preferred Qualifications



  • Prior experience leading or advising multi-expert technical teams across multiple cybersecurity disciplines.
  • Understanding of LLM architectures and AI model evaluation processes.
  • Familiarity with T&E in government or defense settings (e.g., AFWERX, MITRE, DoD AI efforts).
  • Certifications such as CISSP, OSCP, GCIH, GCIA, GPEN, or equivalent.

Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend.

The base salary range for this full-time position in the location of Washington DC is:
$180,000 $200,000 USD

PLEASE NOTE:Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants.

About Us:

At Scale, we believe that the transition from traditional software to AI is one of the most important shifts of our time. Our mission is to make that happen faster across every industry, and our team is transforming how organizations build and deploy AI. Our products power the world's most advanced LLMs, generative models, and computer vision models. We are trusted by generative AI companies such as OpenAI, Meta, and Microsoft, government agencies like the U.S. Army and U.S. Air Force, and enterprises including GM and Accenture. We are expanding our team to accelerate the development of AI applications.

We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status.

We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at accommodations@scale.com. Please see the United States Department of Labor's Know Your Rights poster for additional information.

We comply with the United States Department of Labor's Pay Transparency provision.

PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants' needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

Applied = 0

(web-5cf844c5d-bzcc6)