Phase 3 - CR Hyperparameters & Variants

Created on September 30|Last edited on October 13

Comment

We first do experiments on the trimmed set and then see whether those insights generalise to the main set.
Sweep: Vary Loss Scales﻿
Onto F1
Onto F1
10203040506070Step00.10.20.30.4
sweeppr-crpr-on-trim-nopr
sweeppr-crpr-on-trim-3
sweeppr-crpr-on-trim-2
sweeppr-crpr-on-trim-1
sweeppr-crpr-on-trim-0
Run set5
﻿
﻿
In varying the loss scales there does not seem to be any difference. Now we vary the learning rate. Its best to keep them as equal I guess. 
I still suspect that there might be no influence of this factor.
Sweep: LR , ELRHere we vary learning rate and encoder learning rate.
LR  : 2e-5 4e-5 1e-4
ELR: 1e-5 5e-5 2e-6
﻿
﻿
Run set16
﻿
So from this it seems that the best performing combinations are as follows:
LR = 1e-3 and ELR (encoder learning rate) = 5e-5

LREncoder LRAvg F1
1e-35e-50.4578
5e-45e-50.4497
2e-41e-50.4268
5e-41e-50.4208
﻿
Another thing to note is that all of crpr runs are worse than `cr runs. That can't be a good sign!
Sweep: CRPR/CR Trim Baselines (default param, varying size)
LR Full Dataset (Sweep LR Results)Clearly, the 'default' learning rate - 2e-4, 1e-5 seems to work better than every other one. So, NO, the insights from limited 50 instance experiments do not directly translate to the main, whole dataset.
So then, we need to figure out what's a good size of dataset after which we can conclude that indeed, it works.
﻿
Run set5
﻿
Clearly, the 'default' learning rate - 2e-4, 1e-5 seems to work better than every other one. So, NO, the insights from limited 50 instance experiments do not directly translate to the main, whole dataset.
Unary hdim
HOI Trainer﻿

LR	Encoder LR	Avg F1
1e-3	5e-5	0.4578
5e-4	5e-5	0.4497
2e-4	1e-5	0.4268
5e-4	1e-5	0.4208

Add a comment

Priyansh Trivedi • 3 years ago

# Unary hdim Test is inconclusive. Trims still don't represent the real deal. unary_hdim of 1000 works better. But there's little difference between 500 and 100 🤷

Priyansh Trivedi • 3 years ago

# HOI Trainer There is NO difference in the two. Ignore hoitrainer.