Viewing a single comment thread. View all comments

Bobertus t1_iya8rw6 wrote

How does this relate to spaced repetition?

1

nszceta OP t1_iyaaq9l wrote

Spaced repetition approaches model ngram learning rate and forgetting rate over time. The multi armed bandit method does not make any assumptions about performance over time. Rather, it is an attempt to discover your current abilities. The dynamic component of thompson sampling is implemented to ensure recent changes in per-ngram performance are weighted more heavily than older data points in the underlying statistical model. Thus what is being modeled is a best effort estimate of your abilities given limited exploration time.

1

Bobertus t1_iydl57u wrote

I've heard about the concept of search and exploit/multi armed bandit. But only superficially, and I never heard of Thompson sampling.

I'm wondering if you could make the "dynamic component" model the forgetting curve such that data points that probably aren't forgotten yet are weighted higher, giving you a new kind of spaced repetition algorithm.

I've been thinking of experimenting with creating spaced repetition software and I had the idea for a typing trainer using some kind of spaced repetition algorithm.

1

nszceta OP t1_iydqz0t wrote

Spaced repetition algorithms are a minefield. I have developed and partially validated an unpublished (so far) SR algo. The biggest failure of existing SR algorithms and something which has been studied in literature since 2016 a phenomemon in the human brain where retrieval performance of an item you want to learn becomes easier and harder seemingly at random. It turns out that there are three distinct phases of memory and your brain switches between them over time as it consolidates things you learn. A regression of performance over time becomes complicated. There are ways to deal with this using regime switching markov decision processes, but it is incredibly easy to get it all wrong. Another essential component of spaced repetitions so many algorithms get wrong is ignoring the spacing effect.

1