| ID | lcf-dsc-002 |
| Asset type | dataset |
| Access Tier | open |
| License | Mixed |
| TLP | TLP:CLEAR |
| Producer | CIRCL |
| Upstream Sources | Vulnerability-Lookup |
| Access Method | HuggingFace Dataset page — direct Parquet download, Python libraries (datasets, pandas, Polars). No account required. |
| Lineage | lcf-ds-001 (derived-from) |
| Format | Parquet |
| Consumption Mode | standalone |
| Required Tooling | n/a |
| Quality | automated |
| Frequency | periodic |
| Volume | 573k rows |
| AI Lifecycle Role | training-data; fine-tuning-data; inference-input |
| Intended Use | NLP research, text generation, language model fine-tuning |
| Known Limitations | Unlabeled subset. Multi-language content |
| DCAT-AP Class | dcat:Dataset |