Reports
Created by
Created On
Last edited
Reproducing Results for Open-Source, PyFlex
Only ToolFlowNet and Direct Vector MSE, for both variants of PourWater (3D and 6D action spaces). Uses PyFlex, which is what we can release!
0
2023-02-03
Reproducing Results for Open-Source, PyFlexRobotics
Only ToolFlowNet and Direct Vector MSE (an older version), for both variants of PourWater (3D and 6D action spaces). Uses PyFlexRobotics, NOT PyFlex.
0
2023-01-31
Rebuttal: RGBD Results (and Segm Depth)
Now we have RGBD input (woo hoo) and can test them. This report has them for all three versions (pouring, and 4d and 6d scooping). Note: RGBD for here, not RGB (unless strictly for comparison purposes). UPDATE: actually now I feel like Depth-Segmentation will give better comparisons.
0
2022-08-18
SAC Results on Scooping
For both 3D and 6D action spaces, for both scooping tasks. For both keypoints and images (the latter means CURL). Also keypoints == state information.
0
2022-08-02
BC04 Multi-Sphere Results (4DoF Demo Data)
4DoF demo data (again, trying now). See other report for 3DoF demo data.
0
2022-07-13
Algorithmic Demonstrator Training Runs
This a summary of the training results for ToolFlowNet, when trained on the newer algorithmic (translation-only) demonstrator.
0
2022-07-20
BC04: Data Augm (Scooping/MMOneSphere)
After CoRL, trying more data augmentation using the 50 demo case.
0
2022-07-13
BC04 (Pouring) SVD Flow-Based Methods
This should be for our 'proposed' 3D and 6D flow methods; ablations can go elsewhere.
0
2022-06-10
Multi-Sphere, Comparing Point Clouds
Now with updated data statistics so we can track how often we are getting one set of balls over another, etc. (Edit: investigating again in mid-July but maybe make a new report as it's harder to group runs, for some reason.)
0
2022-06-08
Physical Experiments, June 22+, Translation-Only
Trying to get the flow methods to work the best on translation-only data for scooping. So far we have something that shows some promise. However, it still feels unreliable and a reviewer who is watching the videos may not be impressed.
0
2022-06-22
BC04 (Pouring and Scooping) Dense Transformation
Might as well combine these together? Not sure we have many things to talk about here.
0
2022-06-12
Pouring, Reduce Tool Experiments
Now we just have 5 such points in the point cloud! (Edit: now 10)
0
2022-06-10
BC03 Scooping 3D Flow Ablation: MSE after SVD (not pointwise)
This is an ablation for 3D flow, to be clear, not 6D flow in case we are using 3D flow as the method.
0
2022-06-12
Physical Datasets (Naive Methods, Translation Data v03)
Focusing on training of baseline methods here. Just do translation data.
0
2022-06-12
BC03 4DoF, 1 Sphere, PN++ SVD Pointwise 6D flow, ee2flow, 100 demos
Finally moving towards 4D manipulation. NOTE: will also put some ablations here. This will focus on 6D flow, to be clear.
0
2022-05-19
Physical Datasets (Naive Methods, Translation Data v02)
Focusing on training of baseline methods here. Just do translation data.
0
2022-06-11
BC04 (Pouring) Naive CNN Methods
Check (a) the new experimental setting, (b) scaling / no scaling variants.
0
2022-06-10
BC03, 1 Sphere, Ablations
(1) No skip connections, (2) pointwise before SVD, (3) using dense loss
0
2022-06-05
Pouring, Compare # of Demos
How does performance vary as a function of demonstration count?
0
2022-06-03
BC03, 1 Sphere, Noise to PCL
Experiment with adding noise to point cloud. See here for visuals on what extra noise means to the point clouds, etc.
0
2022-06-06
Pouring, Naive Baselines (Pointwise with Classification PN++)
Why is this baseline so awful? We really have to investigate, I think.
0
2022-06-04
Pouring, SVD Methods (Ablation Test)
This is more of an ablation where we compare pointwise vs MSE losses.
0
2022-06-04
Pouring, Dense Transformation Methods
Let's try the dense transformations methods. It seems like this will be giving slightly better performance relative to pouring?
0
2022-06-05
BC03 4DoF, 1 Sphere, Baseline Methods (PN++ and CNN)
These are baseline methods that we hope do not do well. :). UPDATE: also adding the naive PN++ baseline but where we use pointwise loss! That will tell us if pointwise is sufficient, or if we need more. (UPDATE 06/02/2022, running again, see notes here)
0
2022-05-20
BC03 4DoF, 1 Sphere, # of demos
Check to see how much performance varies based on # of demos.
0
2022-05-27
Pouring, Naive Vector Methods, data v02
Now we're going to deal with pouring as our second environment. This has a bunch of the 'naive' baselines.
0
2022-06-01
BC03 4DoF, 1 Sphere, Dense Transformation
This is the simple dense transformation we've been talking about.
0
2022-05-27
BC03 4DoF, 1 Sphere, ee2flow, 3D flow
Using 3D flow. Testing with and without consistency. Objective: compare 3D flow with 6D flow and see if 3D is worse (due to coupling between the rotation and translation).
0
2022-05-27
BC03 3DoF, 1 Sphere, PN++ Naive EE
Classification PN++. Testing with and without the older data.
0
2022-05-13
BC03 3DoF, 1 Sphere, PN++ SVD Pointwise Tool Flow (after bugfix).
Has 15 trials with and without the consistency loss (and, w/out consistency but with OLDER v02 3DoF data). After the 05/16 bug fix where we properly scale `data.pos`.
0
2022-05-16
BC03 3DoF, 1 Sphere, Compare Effect of #Demos
For different methods, compare the effect of # of training data points.
0
2022-05-20
BC03, 3DoF, 1 Sphere, Dense Transform
Trying out the simplest Dense Transform possible. NOTE: this is using 3DoF dense transform so it is translation only.
0
2022-05-27
BC03 3DoF, 1 Sphere, PN++ averaging layer (debugging, prior data)
Using older data (ladle_algorithmic_v02) to debug
0
2022-05-16
MMOneSphere: BC_PCL_PNet2_ee
This is the 'naive' way of using 'classification' PN++ to regress directly to the 3D position delta, but it seems to do a good job here despite some strange validation MSEs (sometimes they are spiking upwards). [Edit: that's because of dropout...]
0
2022-04-20
BC03 3DoF, 1 Sphere, RGB CNN Baseline (with and without data aug)
Naive method, however I expect it to work well on the translation only case. I'm testing with and without data augmentation, the data augm uses randomized image crops.
0
2022-05-14
BC03 3DoF, 1 Sphere, 6D Flow
At test time, we just keep the first 3 dimensions of flow (which are translations) and run the average of it.
0
2022-05-16
BC03 3DoF, 1 Sphere, PN++ Per-Point Flow then Avg
The average is NON differentiable, just something we do after the fact.
Also unfortunately we can't look at the evaluation flow plots because those were not wrapped in the `utils.eval_mode()` and thus have the dropout making it look weird. :( Might want to re-do these, then.
0
2022-05-14
BC03 3DoF, 1 Sphere, PN++ Pointwise Flow no consistency (debugging, prior data)
Using older data (ladle_algorithmic_v02) to debug.
0
2022-05-16
BC #02: MMOneSphere, ntrain100, PNet2_svd_pointwise_flow
3DoF demonstrators, testing pointwise loss using `flow` instead of `ee2flow`, which might be more realistic but may lose information related to collisions.
0
2022-04-30
BC03 3DoF, 1 Sphere, PN++ SVD Pointwise Flow
WITHOUT the consistency loss. NOTE: cannot look at the evaluation flow visualizations or MSEs because that doesn't have the correct model.eval() but the action selection should be the same. Unfortunately doesn't look that good?
0
2022-05-13
BC03 3DoF, 1 Sphere, PN++ SVD Pointwise Flow
WITH the consistency loss. NOTE: cannot look at the evaluation flow visualizations or MSEs because that doesn't have the correct model.eval() but the action selection should be the same. Unfortunately doesn't look that good?
0
2022-05-13
(05/09/2022) More Overfitting Tests
More tests with overfitting to 1 demonstration, now with (fixed?) flow and rotation. Some promising results and interesting observations, mostly with 4DoF demonstrators.
0
2022-05-09
BC#03 Test Overfitting to 1 Demonstration
Need to see if this can overfit to 1 demonstration. Done before running larger-scale BC#03 experiments, with 10 runs total (5 for 1-sphere, 5 for multi-sphere).
0
2022-05-06
BC #02: MMOneSphere, RGB Baseline with CNN
Hopefully flow does better. Actually it seems like it does? See the comparisons here
0
2022-05-03
BC #02 MMMultiSphere PNet2_svd_pointwise_flow
See if flow can be used with SVD for the multi-sphere case.
0
2022-05-02
BC #02 MMMultiSphere PNet2_avg_ee 3DoF
Averaging layer, which was doing the best with 1 sphere env.
0
2022-05-03
BC #02 MMMultiSphere "Naive" Methods, Regress Straight to 3DoF (use PN++ and RGB tests)
Naive method with classification PN++, regressing to 3DoF pose, now for multi-sphere env. Update: also testing with the RGB network.
0
2022-05-03
BC #02: MMOneSphere, ntrain{100,500}, PNet2_eepose (4DoF demonstrator)
On the newer dataset now regressing straight to `eepose` so it's 6DoF axis angle, and then for test time we end up clearing out the x and z axis for the rotation so it's just the y axis. Looks bad which is good (well, assuming our other methods do better). NOTE: this hack could be an issue because the delta (amount of rotation) depends on the magnitude of the axis-angles. Of course, again the MSEs all had x and z as 0 so if it predicts something there it isn't learning correctly...
0
2022-04-28
BC #02: MMOneSphere, SVD pointwise, check 4DoF demo, varying ntrain + some debugging
Now checking SVD pointwise on the 4DoF demonstrator! Need to also carefully check the demonstrator since I had some bugs in my earlier code with rotations.
0
2022-05-02
BC #02: MMOneSphere, ntrain100, PNet2_svd_pointwise_ee2flow
I actually have 7 results here, 1 from Eddie, 3X runs from me 04/27, and 3X more runs 04/29 (should be the same, I was debugging something but I think all 7 are under same experimental settings). These show pointwise SVD, but we are still using a 3DoF action space. This means the output of the rotation is ignored (see notes below).
0
2022-04-28
BC #02: MMOneSphere, ntrain100, PNet2_avg_ee
Try the naive averaging layer this time using the new BC dataset. Remember, this was the one that did really well, but only for the translation-only data, so we hope to do at least as good as this with SVD if we have no rotations in the demonstrator. UPDATE: yes, results are really good! Now if only we can get SVD to do as well as this. :X
0
2022-04-28
BC #02: MMOneSphere, "Naive" Method, Regress Straight to 3DoF
This report will analyze the "naive" way of BC with 3DOF data / actions: we take the segmented point cloud, pass it as input to a classification PN++, and then do MSE straight on the 3DoF translation. I also used this to debug / investigate MSEs. I also test with 100 and 500 training demos.
0
2022-04-28
BC #02: MMOneSphere, ntrain100, PNet2_svd_pointwise_flow
This is now doing SVD pointwise on the 4DOF demonstrator! So we actually have to use flow here, not `ee2flow` because `ee2flow` is only if we can copy the ee delta transitions equally to all the flow vectors, and we can't do that with >3DOFs. Unfortunately, the results here aren't that good ... so we're going to have to brainstorm ways to improve on this (or maybe adjust the demonstrator, how the policy produces and implements actions, etc.).
0
2022-04-29
MMOneSphere: BC_PCL_PNet2_flow2ee
Now this is just classification PointNet++, instead of regressing straight to EE delta, we regress to the flow which has been averaged to the EE delta. This is OK since there are no rotations here. But now this has a lot of clipped values so I expect performance to be a bit lower compared to the `BC_PCL_PNet2_ee` case.
0
2022-04-21
MMOneSphere: BC_PCL_PNet2_ee2flow
This is the raw flow, so we are: (a) doing MSE per-point separately on all tool points, (b) get the 'action' from the agent by averaging the output. Not working that well, results seem consistent with what I've seen previously.
0
2022-04-20
MMOneSphere: BC_PCL_PNet2_avg_ee
Here we are now backpropagating through an averaging layer, so this is using segmentation PN++ but regressing to a single EE via an average at the end (just 3D position change). Surprisingly this seems to work very well. In fact these learning curves look REALY good.
0
2022-04-20
MMOneSphere: BC_PCL_PNet2_avg_flow2ee
Try the backprop-through-average. Compare this with `BC_PCL_PNet2_avg_ee` the only difference is whether we use `flow2ee` (here) or the raw ee, which reflects the effect of collisions. Since we use flow, I am guessing performance should be a bit worse. (Edit: maybe not! It looks good here!) Remember, here we have 3D flow per point, and since demonstrator just did translation, that turns into our "EE position change" as averaging all the flow means we get the same thing.
0
2022-04-20
MMOneSphere: BC_PCL_PNet2_flow
Segmentation version of PN++, regressing onto flow. Challenges: (a) MSE is per-point, (b) flow loses collision info. So this is not expected to work well, and is mainly a baseline. (Update: as expected, pretty bad performance)
0
2022-04-21
MMOneSphere: BC_PCL_PNet2_svd_ee
Now we're running through the differentiable SVD layer for the first time. We'll use EE as targets, and the MSE is applied on both the translation and the quaternion, though the quaternions are all the identity here. The quaternion part should enforce some consistency with the flow predictions, right? (Update: results do show some promise! But they need to be better...)
0
2022-04-21
MMMultiSphere (v02): BC_PCL_PNet2_ee
Multi-sphere env with 1 distractor. This is the "naive" way where we have a classification variant of PN++ and we just regress to the target 3D EE position change.
0
2022-04-21
MMMultiSphere (v02): BC_PCL_PNet2_flow2ee
Let's see if flow2ee is a bit worse compared to just ee. This is with the classification PN++. It would be good to get more verification that the action magnitudes could cause a difference in performance (this seems reasonable to me).
0
2022-04-23
MMMultiSphere (v02): BC_PCL_PNet2_ee2flow
Segmentation based PN++ where it outputs flow and where we do point-wise MSE against the ground-truth flow. This is likely not to be that good even if we use "ee" information for the flow. Update: yes it's pretty bad.
0
2022-04-22
MMMultiSphere (v02): BC_PCL_PNet2_avg_ee
Now doing the multi-sphere env with 1 distractor (might be hard). Here we have the segmentation PN++ with an averaging layer so we backprop through the average.
0
2022-04-21
MMMultiSphere (v02): BC_PCL_PNet2_svd_ee
Now running through differentiable SVD layer with the multi-sphere env. Again this is translation-only so the quaternions are kind of irrelevant but we do try and apply a MSE on the quaternions to make the resulting flow map to (1,0,0,0).
0
2022-04-22
MMMultiSphere (v02): BC_PCL_PNet2_avg_flow2ee
To compare with: (a) PN++ with classification flow2ee, so to see effect of the two architectures of PN++, (b) PN++ with average layer but with ee, to see the effect (deterioration?) of using flow instead of ee with the loss of collision information.
0
2022-04-23