Patrick Johnson
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -6,10 +6,53 @@ language:
|
|
6 |
base_model:
|
7 |
- google-bert/bert-base-german-cased
|
8 |
---
|
9 |
-
Model Card for Model ID
|
10 |
|
11 |
-
|
12 |
-
Model Details
|
13 |
-
Model Description
|
14 |
|
15 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
base_model:
|
7 |
- google-bert/bert-base-german-cased
|
8 |
---
|
|
|
9 |
|
10 |
+
# Model Card for norygano/C-BERT
|
|
|
|
|
11 |
|
12 |
+
CausalBERT (C-BERT) is a multi-task fine-tuned German BERT that extracts causal attributions
|
13 |
+
— identifying INDICATORs and ENTITY spans, then classifying CAUSE/EFFECT relationships.
|
14 |
+
|
15 |
+
## Model details
|
16 |
+
|
17 |
+
- **Model architecture**: BERT-base-German-cased + token & relation heads
|
18 |
+
- **Fine-tuned on**: custom causal attribution corpus (German)
|
19 |
+
- **Tasks**:
|
20 |
+
1. Token classification (BIO tags for INDICATOR / ENTITY)
|
21 |
+
2. Relation classification (CAUSE, EFFECT, INTERDEPENDENCY)
|
22 |
+
|
23 |
+
## Usage
|
24 |
+
|
25 |
+
```python
|
26 |
+
from transformers import AutoTokenizer
|
27 |
+
from causalbert.infer import load_model, analyze_sentence_with_confidence
|
28 |
+
|
29 |
+
model, tokenizer, config, device = load_model("norygano/C-BERT")
|
30 |
+
result = analyze_sentence_with_confidence(
|
31 |
+
model, tokenizer, config, "Autoverkehr verursacht Bienensterben.", []
|
32 |
+
)
|
33 |
+
```
|
34 |
+
|
35 |
+
## Training
|
36 |
+
|
37 |
+
- **Base model**: `google-bert/bert-base-german-cased`
|
38 |
+
- **Epochs**: 3, **LR**: 2e-5, **Batch size**: 8
|
39 |
+
- See [train.py](https://github.com/norygano/causalbert/blob/main/causalbert/train.py) for details.
|
40 |
+
|
41 |
+
## Limitations
|
42 |
+
|
43 |
+
- Only German.
|
44 |
+
- Sentence-level; doesn’t handle cross-sentence causality.
|
45 |
+
- Relation classification depends on detected spans—errors in token tagging propagate.
|
46 |
+
|
47 |
+
## References & Source
|
48 |
+
|
49 |
+
- 🔗 GitHub: https://github.com/norygano/causalbert
|
50 |
+
- 🤗 Hub: https://huggingface.co/norygano/C-BERT// filepath: /mnt/work/Projects/BERTopic_AiO/data/model/C-BERT/README.md
|
51 |
+
---
|
52 |
+
library_name: transformers
|
53 |
+
license: apache-2.0
|
54 |
+
language:
|
55 |
+
- de
|
56 |
+
base_model:
|
57 |
+
- google-bert/bert-base-german-cased
|
58 |
+
---
|