POS vs NER for Coref

Created on August 25|Last edited on August 30
Comment
﻿
Multidomain - Multitask
Target Coref: DWIE; Aux Source: Ontonotes
So in effect, we're trying to find that when training together, which works better:
Ontonotes POS 
Ontonotes NER
Ontonotes Coref
When the target task is Coref on DWIE dataset
﻿
﻿
B3 Precision
B3 Precision
10203040Step00.20.40.60.81
task_2.names: ["pos"]
task_2.names: ["coref","pruner"]
task_2.names: ["ner"]
B3 Recall
B3 Recall
10203040Step00.20.40.60.81
task_2.names: ["pos"]
task_2.names: ["coref","pruner"]
task_2.names: ["ner"]
DWIE Valid B3 F1
DWIE Valid B3 F1
10203040Step00.20.40.60.81
task_2.names: ["pos"]
task_2.names: ["coref","pruner"]
task_2.names: ["ner"]
Run set456
﻿
﻿
ObservationsNER and POS seem almost the same
Coref in some settings is better, in some settings is worse
These are just the first experiments. We have not done hyperparameter optimisation, or error analysis just yet. They're up next.
﻿
Single Domain - Multitask - TRIMNOTE: This experiment is done with only 50 instances in each dataset for quick turnaround time.
Dataset: DWIE﻿
﻿
Run set456
﻿
﻿
ObservationsThere seems to be little evidence here that NER is more beneficial than POS, infact. 
Dataset: OntonotesNOTE: This experiment is done with only 50 instances in each dataset for quick turnaround time.
﻿
﻿
Run set4
﻿
﻿
Add a comment