Diverse Mini-batch Active Learning #134

mbrine555 · 2021-06-22T01:42:38Z

This is a PR that implements a new batch active learning query strategy (as mentioned in #119). Diverse Mini-batch Active Learning attempts to take into account both informativeness and diversity when selecting a batch of new examples to be labeled. It's also worth noting that this involves bumping the required scikit-learn version from 0.18 -> 0.20.

I'm not sure if there's any additional documentation you'd like to have added around this, so just let me know!

…cluster center

damienlancry · 2021-10-10T09:11:25Z

modAL/batch.py

+    Returns:
+        Indices of the instances from `X` chosen to be labelled
+    """
+    uncertainty = classifier_margin(classifier, X, **uncertainty_measure_kwargs)


so you only support margin uncertainty? I would suggest to add the callable as param of the function, and default to classifier_margin.

damienlancry · 2021-10-10T09:13:41Z

modAL/batch.py

+
+    # Limit data set based on n_instances and filter_param
+    record_limit = filter_param * n_instances
+    keep_args = np.argsort(uncertainty_scores)[-record_limit:]


argsort is suboptimal in this case because we only need to partition at the record_limitth instance.
argpartition is better suited for that. it is O(n) as opposed to O(nlog(n)) for argsort. you can use multi_argmax, or shuffled_argmax already implemented in selection.py

mbriner added 6 commits June 19, 2021 12:16

sketch out diverse k-means implementation

9048f43

update some comment and docstrings

81c349e

make filter_param actually do something

0f89ab3

update tests

f2321cb

bump scikit-learn version to support weighted kmeans

22fadce

modify to return points closest to each cluster center, not just any …

52afb6a

…cluster center

damienlancry reviewed Oct 10, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Diverse Mini-batch Active Learning #134

Diverse Mini-batch Active Learning #134

mbrine555 commented Jun 22, 2021 •

edited

Loading

damienlancry Oct 10, 2021

damienlancry Oct 10, 2021

Diverse Mini-batch Active Learning #134

Are you sure you want to change the base?

Diverse Mini-batch Active Learning #134

Conversation

mbrine555 commented Jun 22, 2021 • edited Loading

damienlancry Oct 10, 2021

Choose a reason for hiding this comment

damienlancry Oct 10, 2021

Choose a reason for hiding this comment

mbrine555 commented Jun 22, 2021 •

edited

Loading