|
META TOPICPARENT |
name="MLRoad-map" |
Overview |
|
- Flag boguses with light curve observations greater than 2.
- Catch variable stars with isdiffpos=True and false in the light curve
- Implement proper grid search over RF parameter space
|
|
> > |
- finish the work with the alternate test set, and getting stats on that.
- make sure imputation is happening correctly
- get Tomas's other plots integrated
- KL divergence code, should be updated.
|
|
- Pipeline Analysis to automate:
- Score improvement on known false positive, negatives (Ragnhild's List)
- KL divergence between training set, test set features (to find major divergence)
- Plot feature distributions on reals vs. boguses (Tiara's code)
|
|
< < |
-
- What do low RB reals look like, what do high RB bogus look like?
- Unlabeled Data
- Score bias per features
|
> > |
-
- Classifier error analysis: What do low RB reals look like, what do high RB bogus look like?
|
|
-
- Report feature importance by correlated feature groups
Data Collection
- Automated Data Contamination [ Charlotte ]
|