Sources/EU/EUR-Lex

From Dura Lex Wiki
Jump to navigation Jump to search

EU legislation from the Publications Office CELLAR database. Regulations, directives, decisions, recommendations, and international agreements. Article-level granularity with chapter/section hierarchy. Year-by-year from 1952 via SPARQL.

Status: active

Source

[edit | edit source]
Key Kind Origin Auth Update
eurlex legislation + section SPARQL + CELLAR download — publications.europa.eu/webapi/rdf/sparql none incremental (year-by-year)

ID format:

  • Text root: eu.eurlextext{celex.lower()}:{lang} (e.g. eu.eurlextext32016r0679:fr)
  • Article: eu.eurlexarti{celex.lower()}_{article_number}:{lang}
  • Chapter/section: eu.eurlexscta{celex.lower()}_{html_id}:{lang}

Coverage: 1952–present

Classification

[edit | edit source]
CELEX nature code nature
R REGLEMENT
L DIRECTIVE
D DECISION
H RECOMMANDATION
X ACCORD_INTERNATIONAL

type = statute (always). number derived from CELEX pattern (e.g. 2016/679).

Enforcement

[edit | edit source]
enforcement_status in_force Signal
in_force true SPARQL in-force signal present
repealed false end_of_validity is past

Only these two values are emitted (full vocabulary defined in TAG-CONVENTIONS but not used by this source).

needs_review = "enforcement_status_missing" (no signal) or "enforcement_status_contradiction" (in-force=true but end-of-validity is past).

Structure

[edit | edit source]
Record type kind Notes
Root text node section structure_type: "legislation", text_id = cross-language canonical ID
Chapter section parent = text root
Sub-section section parent = chapter
Article legislation parent = nearest enclosing section
Preamble legislation article_number: "PREAMBULE"

Identifiers

[edit | edit source]
Tag Notes
celex CELEX identifier
text_id cross-language canonical work ID (no :lang suffix)

Multilingual

[edit | edit source]

Each language version is a separate row with :{lang} suffix. translation_pending set when content was fetched in EN fallback.

Edges

[edit | edit source]
kind Direction Trigger
language_variant EN ↔ FR post-ingest LLM translation enrichment
[edit | edit source]
Source Relationship
cjeu CJEU decisions interpret EU legislation. No cross-source edges yet.

Limitations

[edit | edit source]
  • Only language="fr" ingested — multi-language support planned
  • Only R, L, D, H, X nature codes — other CELEX types not ingested
  • No citation edges between EU legislative acts
  • Legacy/PDF format documents produce a single article (no structural breakdown)