Upload folder using huggingface_hub

Browse files

Files changed (9) hide show

README.md +177 -0
added_tokens.json +3 -0
config.json +43 -0
model.safetensors +3 -0
special_tokens_map.json +15 -0
spm.model +3 -0
tokenizer.json +0 -0
tokenizer_config.json +59 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,177 @@

+---
+language: en
+license: mit
+library_name: transformers
+tags:
+- text-classification
+- character-analysis
+- plot-arc
+- narrative-analysis
+- deberta-v3
+- binary-classification
+datasets:
+- custom
+metrics:
+- accuracy
+- f1
+model-index:
+- name: plot-arc-classifier
+  results:
+  - task:
+      type: text-classification
+      name: Character Plot Arc Classification
+    dataset:
+      type: custom
+      name: Character Arc Dataset
+    metrics:
+    - type: accuracy
+      value: 0.796
+      name: Accuracy
+    - type: f1
+      value: 0.796
+      name: F1 Score (Strong Class)
+    - type: precision
+      value: 0.777
+      name: Precision (Strong Class)
+    - type: recall
+      value: 0.816
+      name: Recall (Strong Class)
+base_model: microsoft/deberta-v3-xsmall
+---
+# Plot Arc Character Classifier
+A DeBERTa-v3-XSmall model fine-tuned to classify fictional characters based on their plot arc potential.
+## Model Description
+This model classifies character descriptions into two categories:
+- **STRONG** (label 1): Characters with both internal conflict and external responsibilities/events
+- **WEAK** (label 0): Characters with no plot arc, pure internal conflict only, or pure external events only
+The model fixes critical bias issues where simple background characters (shopkeepers, guards) were incorrectly classified as plot-significant.
+## Training Data
+- **Dataset Size**: 11,888 balanced examples (50/50 split)
+- **Training Examples**: 9,510
+- **Validation Examples**: 2,378
+- **Source**: Custom 4-way classified character descriptions from literature
+### Label Mapping
+- **STRONG (1)**: Characters classified as "BOTH" (internal conflict + external events)
+- **WEAK (0)**: Characters classified as "NONE", "INTERNAL", or "EXTERNAL"
+## Training Details
+- **Base Model**: microsoft/deberta-v3-xsmall (22M parameters)
+- **Training Time**: ~15 minutes
+- **Batch Size**: 8 (with gradient accumulation = 2)
+- **Max Sequence Length**: 384 tokens
+- **Learning Rate**: 5e-5 with warmup
+- **Early Stopping**: Yes (stopped at 3.7/5 epochs)
+## Performance
+### Validation Metrics
+| Metric | Score |
+|--------|-------|
+| Accuracy | 79.6% |
+| F1 (Strong) | 79.6% |
+| Precision (Strong) | 77.7% |
+| Recall (Strong) | 81.6% |
+### Synthetic Test Results
+**100% accuracy** on diverse test cases including previously problematic examples:
+| Character Type | Example | Prediction | Confidence |
+|----------------|---------|------------|------------|
+| Background (NONE) | Baker, Guard | WEAK ✅ | 98.9%, 98.5% |
+| Pure Internal | Haunted Artist | WEAK ✅ | 93.9% |
+| Pure External | Military Commander | WEAK ✅ | 94.5% |
+| Both (Internal+External) | Conflicted King | STRONG ✅ | 95.1% |
+| Both (Trauma+Mission) | PTSD Captain | STRONG ✅ | 95.5% |
+| Both (Doubt+Quest) | Uncertain Prophet | STRONG ✅ | 96.0% |
+**Key Achievement**: Fixed critical bias where simple background characters were incorrectly classified as plot-significant.
+## Usage
+```python
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+# Load model and tokenizer
+tokenizer = AutoTokenizer.from_pretrained("plot-arc-classifier")
+model = AutoModelForSequenceClassification.from_pretrained("plot-arc-classifier")
+# Example usage
+def classify_character(description):
+    inputs = tokenizer(description, return_tensors="pt", truncation=True, max_length=384)
+    with torch.no_grad():
+        outputs = model(**inputs)
+        probabilities = torch.nn.functional.softmax(outputs.logits, dim=-1)
+        predicted_class = torch.argmax(probabilities, dim=-1).item()
+    labels = {0: "WEAK", 1: "STRONG"}
+    confidence = probabilities[0][predicted_class].item()
+    return labels[predicted_class], confidence
+# Test examples
+examples = [
+    "A baker who makes fresh bread daily and serves customers with a smile.",
+    "A warrior haunted by past failures who must lead a desperate battle to save his homeland while confronting his inner demons.",
+]
+for desc in examples:
+    label, conf = classify_character(desc)
+    print(f"'{desc[:50]}...': {label} ({conf:.3f})")
+```
+## Model Improvements
+This model addresses critical issues from previous versions:
+1. **Fixed Bias**: No longer classifies simple background characters as STRONG
+2. **Proper Discrimination**: Requires both internal and external elements for STRONG classification
+3. **Balanced Training**: 50/50 split prevents class imbalance issues
+4. **Clean Taxonomy**: Based on proper 4-way character analysis
+## Limitations
+- Trained on English literary character descriptions
+- May not generalize well to other domains (screenwriting, gaming, etc.)
+- Performance may degrade on very short or very long descriptions
+- Cultural bias toward Western narrative structures
+## Ethical Considerations
+This model is designed for narrative analysis and creative writing assistance. It should not be used to make judgments about real people or for any discriminatory purposes.
+## Citation
+If you use this model, please cite:
+```bibtex
+@misc{plot-arc-classifier-2024,
+  title={Plot Arc Character Classifier},
+  author={Generated with Claude Code},
+  year={2024},
+  url={https://huggingface.co/plot-arc-classifier}
+}
+```
+## Training Infrastructure
+- **Framework**: 🤗 Transformers
+- **Hardware**: Apple Silicon (MPS)
+- **Optimization**: Memory-optimized for MPS training
+- **Early Stopping**: Enabled to prevent overfitting
+---
+🤖 Generated with [Claude Code](https://claude.ai/code)
+Co-Authored-By: Claude <[email protected]>

added_tokens.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "[MASK]": 128000
+}

config.json ADDED Viewed

	@@ -0,0 +1,43 @@

+{
+  "architectures": [
+    "DebertaV2ForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 384,
+  "id2label": {
+    "0": "WEAK",
+    "1": "STRONG"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 1536,
+  "label2id": {
+    "STRONG": 1,
+    "WEAK": 0
+  },
+  "layer_norm_eps": 1e-07,
+  "legacy": true,
+  "max_position_embeddings": 512,
+  "max_relative_positions": -1,
+  "model_type": "deberta-v2",
+  "norm_rel_ebd": "layer_norm",
+  "num_attention_heads": 6,
+  "num_hidden_layers": 12,
+  "pad_token_id": 0,
+  "pooler_dropout": 0,
+  "pooler_hidden_act": "gelu",
+  "pooler_hidden_size": 384,
+  "pos_att_type": [
+    "p2c",
+    "c2p"
+  ],
+  "position_biased_input": false,
+  "position_buckets": 256,
+  "relative_attention": true,
+  "share_att_key": true,
+  "torch_dtype": "float32",
+  "transformers_version": "4.55.4",
+  "type_vocab_size": 0,
+  "vocab_size": 128100
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f6ce91fabf1a7eb0bff15a40318d19b56965648198c98e39be216583bd8b4969
+size 283347432

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,15 @@

+{
+  "bos_token": "[CLS]",
+  "cls_token": "[CLS]",
+  "eos_token": "[SEP]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": {
+    "content": "[UNK]",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

spm.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c679fbf93643d19aab7ee10c0b99e460bdbc02fedf34b92b05af343b4af586fd
+size 2464616

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,59 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "1": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "2": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "3": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": true,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "128000": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "bos_token": "[CLS]",
+  "clean_up_tokenization_spaces": false,
+  "cls_token": "[CLS]",
+  "do_lower_case": false,
+  "eos_token": "[SEP]",
+  "extra_special_tokens": {},
+  "mask_token": "[MASK]",
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "sp_model_kwargs": {},
+  "split_by_punct": false,
+  "tokenizer_class": "DebertaV2Tokenizer",
+  "unk_token": "[UNK]",
+  "vocab_type": "spm"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3947b09074cc74a0341a06416cf1b03fb6cc4401933e052557f006ccc8f0c9e3
+size 5777