JAMES NDUNGU
AI Engineer (LLM Systems) | AI Operations Specialist | Data & Automation Engineer
Remote Operations • AI Model Evaluation • Data Pipelines • CRM & Workflow Automation • Analytics
PROFESSIONAL SUMMARY
AI Operations Specialist with a Computer Science background and hands-on experience across AI training pipelines, LLM evaluation, CRM automation, and data analytics. Delivered 97% QA accuracy across 3,500+ tasks with a focus on data validation, error detection, and system optimization. Proficient in RLHF evaluation, adversarial prompt testing, inter-annotator agreement, and structured data pipelines. Skilled at leveraging Python, SQL, and BI tools to deliver actionable insights, and experienced in HubSpot CRM automation, SLA compliance, and workflow optimization across remote operations.
Additionally experienced in customer support, virtual assistance, CRM management, lead generation, appointment setting, and accurate data entry, including workflow automation, pipeline management, and reporting for remote teams.
CORE SKILLS
AI & LLM Evaluation
RLHF Evaluation • LLM Response Ranking & Scoring • Prompt Engineering • Adversarial Testing • Hallucination Detection • Bias Identification • Inter-Annotator Agreement • Red-Teaming • Safety & Content Policy Evaluation
Customer Support
HubSpot CRM workflows, SLA compliance • Zendesk & Omnichat experience • Remote operations coordination • Chat, Email, Ticketing Systems (Zendesk, Omnichat) • SLA Management & Remote Coordination
Virtual Assistance
Workflow automation, data pipelines, reporting • HubSpot / Zapier / Make for scheduling, reminders, and pipelines • Financial operations (QuickBooks, AP/AR) • Workflow Automation, Reporting & Scheduling • Automated outreach via Omnichat & email campaigns
Lead Generation & Appointment Setting
HubSpot lead workflows & sequences • Omnichat integration for automated outreach • Email automation for engagement • High-volume, accurate data entry for CRM and AI datasets • SQL / Excel / Google Sheets, ETL pipelines
Data Entry & Validation
SQL & Excel / Google Sheets for structured datasets • ETL pipelines for data cleaning and transformation • Annotation QA & structured data validation • High-volume, accurate data entry for CRM and AI datasets • SQL / Excel / Google Sheets, ETL pipelines
AI Engineering & LLM Systems
LLM Integration & APIs • Retrieval-Augmented Generation (RAG) • Prompt Engineering • Multi-Agent Systems (LangChain, LangGraph – foundational)
Data Annotation
NLP Annotation (Text Classification, NER, Dialogue Scoring) • LiDAR / Point Cloud Annotation • Image Annotation (Bounding Boxes, Segmentation) • Rubric-Based Evaluation • Taxonomy Mapping • QA Feedback Loops
Data Engineering & Analytics
SQL (Queries, Aggregations, JOINs) • Python (pandas, NumPy, Matplotlib) • ETL Pipelines • Data Cleaning & Transformation • Statistical Analysis • A/B Testing • Dashboard Development • Google Sheets (Advanced Functions, QUERY, VLOOKUP, Pivot Tables)
BI & Visualization
Google Looker Studio • Tableau (Fundamentals) • Power BI (Fundamentals) • KPI / SLA Monitoring • Reporting Automation
CRM & Automation
HubSpot CRM (Workflows, Lead Scoring, Sequences, Reporting) • Zendesk • Omnichat • Zapier • Make (Integromat) • Lead Segmentation • Email Automation • Pipeline Management
Systems & Operations
Workflow Automation • Process Optimization • Remote Team Coordination • SLA Compliance • Audit Support • Git (Version Control — Fundamentals)
Financial Systems
QuickBooks Online • NetSuite • AP/AR Management • Cash Flow Tracking • Reconciliation • Compliance Reporting
Soft Skills
Analytical Thinking • Attention to Detail • Critical Reasoning • Remote Collaboration • Data Storytelling • Time Management • Adaptability
PROFESSIONAL EXPERIENCE
QA Reviewer — AI Data Operations2022 – 2024
Remotasks | Remote
• Maintained 97% accuracy across 3,500+ AI training tasks spanning NLP text annotation and LiDAR point cloud annotation
• Evaluated and ranked LLM outputs using RLHF frameworks, applying rubric-based scoring on reasoning quality, factual accuracy, and helpfulness
• Conducted adversarial and red-team prompt testing to identify hallucinations, bias, toxicity, and logical inconsistencies in model responses
• Applied inter-annotator agreement principles and calibration processes to ensure consistent, high-quality dataset outputs
• Performed NER, text classification, dialogue scoring, and content safety labeling across structured annotation pipelines
• Improved dataset quality through root-cause analysis and systematic QA feedback loops, contributing directly to model improvement cycles
• Managed high-volume annotation workloads consistently within strict SLA timelines, earning promotion to QA Reviewer
• Performed high-volume, accurate data entry and validation to maintain AI training datasets.
AI Data Operations & CRM Support Specialist2025
Crezivanta | Remote
• Designed and automated lead management workflows using HubSpot CRM (sequences, workflows, lead scoring) and Omnichat, reducing manual processing time significantly
• Reduced CRM data errors by 40% by implementing structured data validation systems and pipeline hygiene protocols
• Maintained 100% SLA compliance across communication pipelines, monitoring performance through HubSpot reporting dashboards
• Configured lead segmentation, automated follow-up sequences, and email automation to improve conversion and engagement rates
• Analysed campaign performance data to identify optimization opportunities, delivering actionable insights to improve outreach effectiveness
• Integrated Zapier and Omnichat with HubSpot to streamline cross-platform data flow and reduce manual intervention
• Generated leads and scheduled client appointments using CRM automation workflows.
• Provided virtual assistance to remote teams by managing client pipelines, reporting, and automated follow-ups.
Data Analyst2024 – 2026
Freelance | Remote
• Performed statistical analysis and trend identification using Python (pandas, NumPy) and SQL on structured client datasets
• Built interactive dashboards and automated reporting systems for real-time KPI and performance monitoring
• Designed and executed ETL workflows — extracting, cleaning, transforming, and loading data for analysis and reporting
• Applied A/B testing frameworks and data-driven decision-making methodologies to deliver actionable client insights
• Delivered data storytelling reports to communicate findings clearly to non-technical stakeholders
• Managed structured client datasets, performing data entry, cleaning, and validation for operational and CRM purposes.
Financial Operations Associate2022 – 2025
Riva Supermarket & Restaurant
• Managed financial transactions with 100% reconciliation accuracy using QuickBooks Online
• Maintained detailed AP/AR records, cash flow tracking, and financial reporting aligned with compliance requirements
• Supported internal audit processes and produced periodic financial summaries for management review
• Resolved customer payment discrepancies efficiently, maintaining high satisfaction and operational continuity
TECHNICAL PROJECTS
AI Response Evaluation Tool
• Built a structured evaluation system to score LLM outputs across accuracy, reasoning quality, hallucination rate, and safety compliance
• Implemented rubric-based scoring metrics aligned with RLHF and preference ranking methodologies
• Developed a lightweight interface for systematic A/B response comparison and adversarial test logging
• Stack: Python, Google Sheets, structured evaluation templates
Data Pipeline & Analytics Project
• Designed an end-to-end ETL pipeline to extract, clean, transform, and analyse structured datasets
• Used Python (pandas, NumPy) and SQL for data processing, validation, and aggregation
• Generated actionable insights through automated dashboards and scheduled reporting tools
Adversarial Prompt Testing Lab
• Created a controlled environment for red-team testing of AI models using edge-case and adversarial prompts
• Logged and categorised failure patterns including hallucinations, bias, reasoning errors, and safety violations
• Produced structured test case libraries to support iterative model evaluation workflows
CRM Workflow Automation Simulator
• Simulated a full HubSpot-style lead management system with automated sequences, lead scoring logic, and pipeline tracking
• Integrated qualification rules, response triggers, and follow-up automation to reduce manual touchpoints
• Demonstrated end-to-end CRM workflow design applicable to sales and marketing operations roles
SQL Data Analysis Dashboard
• Developed multi-table SQL queries for data extraction, aggregation, and performance reporting
• Structured datasets for efficient querying and created dashboards for real-time decision-making support
Annotation Quality Checker Tool
• Built a rule-based validation tool to detect annotation inconsistencies, formatting errors, and labelling gaps
• Automated QA checks to improve dataset reliability and reduce manual review effort across annotation pipelines
CRM Lead Management & VA Project
• Simulated full lead generation and appointment scheduling system in HubSpot CRM
• Automated workflows and email sequences to reduce manual tasks
• Integrated dashboards for SLA tracking and lead conversion monitoring
KEY ACHIEVEMENTS
• Achieved and maintained 97% QA quality score across 3,500+ AI training tasks at Remotasks
• Promoted to QA Reviewer in recognition of consistently high accuracy and reliability
• Reduced CRM data errors by 40% at Crezivanta through validation systems and pipeline hygiene
• Maintained 100% SLA compliance across all remote communication and operational pipelines
• Maintained 100% financial reconciliation accuracy across multi-year cashier and financial operations role
EDUCATION & CERTIFICATIONS
B.Sc. Computer ScienceFocus: Networking & Systems
CPA-K — In Progress
Certifications
• Inbound Sales Certification — HubSpot Academy (2026)
• Advanced Data & Spreadsheet Skills Certification
• Google Data Analytics Certificate (Recommended — aligns directly with current role)
TOOLS & TECHNOLOGIES
Programming & Data: Python (pandas, NumPy, Matplotlib, API Development – FastAPI basics) • SQL • Git (Fundamentals)
Spreadsheets & BI: Google Sheets (Advanced) • Google Looker Studio • Tableau (Fundamentals) • Power BI (Fundamentals)
CRM & Automation: HubSpot CRM • Zendesk • Zapier • Make (Integromat) • Omnichat • Slack
Annotation & Evaluation: Label Studio • Scale AI Interface • Remotasks Platform • SuperAnnotate (Familiarity)
Financial & ERP: QuickBooks Online • NetSuite
Cloud Platforms: AWS (EC2, S3 – Basics) • Google Cloud (BigQuery, Vertex AI – Basics)
Customer Support / VA: Zendesk, Omnichat, Slack
Lead Gen / Appointment Setting: HubSpot, CRM workflows, automated email sequences
Data Entry / Validation: Google Sheets (Advanced), Excel, ETL pipelines