Pinnipedia is a new Berlin startup building a cloud platform that automates and assists the creation of audit-ready IT-security concepts (e.g., BSI-Grundschutz, C5). We’re IGP-funded (2025/26) and co-develop with FU Berlin and pilot users from industry and security consulting. We’re hiring an AI Engineer to turn messy inputs into structured knowledge and reliable answers. Your Mission -Own the end-to-end pipeline that turns unstructured documents into a validated, queryable knowledge graph. Accountable for extraction quality, graph integrity, and the data layer that backs the product's read path. Tasks • LLM extraction pipelines -document chunking, property and relationship extraction, cross-chunk reconciliation, gap detection. Built with structured-output LLM agents orchestrated by durable workflows. • Knowledge graph -schema design as typed Pydantic models, Cypher access patterns and indexing strategy, graph operations, schema evolution and migration. Scope ends at the graph boundary: API contracts and query abstractions exposed to consumers belong to the full-stack engineer. • Deterministic rule engines -table-driven evaluators for cases where code beats LLM judgment; clear contracts between deterministic and probabilistic components. • Data validation & quality -schema enforcement, required-property contracts, audit trails, eval harnesses (expert review, unsupervised checks, synthetic fixtures, LLM-as-judge). • Live data ops -backfills, coordinated migrations across relational + graph stores, observability on extraction throughput and quality, incident response. Requirements Must-have 5+ years shipping data/AI systems to production with real customers -has been on-call for live pipelines and knows what breaks at 2am. Strong Python (typed, modern) and SQL. Comfortable with PostgreSQL under load. Production experience with at least one graph database (Neo4j preferred; Neptune, ArangoDB, TigerGraph acceptable) -schema design, query tuning, not toy use. Production LLM pipeline experience: structured output, agent orchestration, prompt and version management, evaluation frameworks. PydanticAI, LangChain, DSPy, or Instructor all welcome. Durable workflow orchestration in production (DBOS, Temporal, Airflow, Prefect, Dagster). Test-first discipline -integration tests against real datastores (Testcontainers or equivalent), not mock-heavy unit tests. Fluent English skills. Nice-to-have Experience with regulated, compliance-driven, or standards-heavy extraction domains (legal, medical, financial, security/audit). Designed deterministic evaluators alongside LLM components and knows when to reach for which. Contributions to data contracts, schema governance, or ontology work. German language skills. Benefits Hybrid, full-time with flexible scheduling; occasional on-site days in Berlin. Competitive salary: 60.000–85.000 € base (more for exceptional senior profiles). Small, focused team; direct collaboration with the Product Owner and Full-Stack Engineer. Modern tooling, real ownership, and a learning budget for role-relevant training. Impact: help SMEs meet rising security requirements with less friction. Apply on JOIN with your CV (PDF) and a short note (max 200 words ) describing how you would design a KG-backed RAG pipeline (ontology scope, indexing, retrieval, and evaluation you’d use). Process: 20-min intro → 90-min practical (graph modeling + retrieval evaluation) → 45-min team chat → references. We review applications within 5 business days . Pinnipedia Technologies GmbH is a Berlin startup building a cloud platform that automates and assists the creation of audit-ready IT-security concepts for German/EU standards (e.g., BSI-Grundschutz, C5). Our product combines LLMs, a domain knowledge graph, and workflow automation to turn scattered inputs (policies, asset lists, audits) into draft security concepts, checklists, risk/impact matrices, and evidence that stands up in audits. We are IGP-funded (2025/26) and co-develop with FU Berlin and pilot users from industry and security consulting. By Aug 2026 we aim to ship a multi-tenant SaaS with hardened cloud setup, CI/CD, quality dashboards and a go-to-market kit. We work as a small, hands-on team with a modern stack, pragmatic documentation, and a focus on reliable, compliant outcomes for SMEs.
Senior Ai / Knowledge Graph Engineer (M/F/D)
JOIN
Berlin, Berlin
Veröffentlicht vor HeuteNeu
Job melden