Sources/EU/EUR-Lex
EU legislation from the Publications Office CELLAR database. Regulations, directives, decisions, recommendations, and international agreements. Article-level granularity with chapter/section hierarchy. Year-by-year from 1952 via SPARQL.
Status: active
Source
[edit | edit source]| Key | Kind | Origin | Auth | Update |
|---|---|---|---|---|
eurlex |
legislation + section | SPARQL + CELLAR download — publications.europa.eu/webapi/rdf/sparql |
none | incremental (year-by-year) |
ID format:
- Text root:
eu.eurlextext{celex.lower()}:{lang}(e.g.eu.eurlextext32016r0679:fr) - Article:
eu.eurlexarti{celex.lower()}_{article_number}:{lang} - Chapter/section:
eu.eurlexscta{celex.lower()}_{html_id}:{lang}
Coverage: 1952–present
Tags
[edit | edit source]Classification
[edit | edit source]| CELEX nature code | → nature
|
|---|---|
| R | REGLEMENT
|
| L | DIRECTIVE
|
| D | DECISION
|
| H | RECOMMANDATION
|
| X | ACCORD_INTERNATIONAL
|
type = statute (always). number derived from CELEX pattern (e.g. 2016/679).
Enforcement
[edit | edit source]enforcement_status |
in_force |
Signal |
|---|---|---|
in_force |
true | SPARQL in-force signal present |
repealed |
false | end_of_validity is past |
Only these two values are emitted (full vocabulary defined in TAG-CONVENTIONS but not used by this source).
needs_review = "enforcement_status_missing" (no signal) or "enforcement_status_contradiction" (in-force=true but end-of-validity is past).
Structure
[edit | edit source]| Record type | kind | Notes |
|---|---|---|
| Root text node | section | structure_type: "legislation", text_id = cross-language canonical ID
|
| Chapter | section | parent = text root |
| Sub-section | section | parent = chapter |
| Article | legislation | parent = nearest enclosing section |
| Preamble | legislation | article_number: "PREAMBULE"
|
Identifiers
[edit | edit source]| Tag | Notes |
|---|---|
celex |
CELEX identifier |
text_id |
cross-language canonical work ID (no :lang suffix)
|
Multilingual
[edit | edit source]Each language version is a separate row with :{lang} suffix. translation_pending set when content was fetched in EN fallback.
Edges
[edit | edit source]kind |
Direction | Trigger |
|---|---|---|
language_variant |
EN ↔ FR | post-ingest LLM translation enrichment |
Related sources
[edit | edit source]| Source | Relationship |
|---|---|
| cjeu | CJEU decisions interpret EU legislation. No cross-source edges yet. |
Limitations
[edit | edit source]- Only
language="fr"ingested — multi-language support planned - Only R, L, D, H, X nature codes — other CELEX types not ingested
- No citation edges between EU legislative acts
- Legacy/PDF format documents produce a single article (no structural breakdown)