AnnA : Anki neuronal Appendix

Tired of having to deal with anki flashcards that are too similar when grinding through your backlog? This python script creates filtered deck in optimal review order. It uses Machine Learning / AI to make semantically linked cards far from one another.

One sentence summary
Note to readers
Other features
FAQ
Getting started
Usage and arguments
TODO list
Credits and links that were helpful
Crazy Ideas

One sentence summaries

Here are different ways of looking at what AnnA can do for you in a few words:

When you don't have the time to complete all your daily reviews, use this to create a special filtered deck that makes sure you will only review the cards that are most different from the rest of your reviews.
When you have too many learning cards and fear that some of them are too similar, use this to automatically review a subset of them.
AnnA helps to avoid reviewing similar cards on the same day.
AnnA allows to reduce the number of daily reviews while increasing (and not keeping the same) retention.

Note to readers

The dev branch is usually less outdated than the main branch.
I would really like to integrate this into anki somehow but am not knowledgeable enough about how to do it, how to manage anki versions, how to handle different platforms etc. All help is much appreciated!
This project communicates with anki using a fork of the addon AnkiConnect called AnnA-companion. Note that AnnA-companion was tested from anki 2.1.44 to 2.1.54 only.
Although I've been using it daily for months, I am still changing the code base almost every day, if you tried AnnA and were disappointed, maybe try it another time later. Major improvements are regularly made.
In the past I implemented several vectorization methods. I now only kept subword TF_IDF. TF_IDF is known to be reliable, fast, very general (it does not assume anything about your cards and will work for just about any language, format, phrasing etc). TF_IDF works very well if you have large number of cards.
If you want to know how I'm using this, take a look at authors_routine.md
If you like this, another project of mine can be used to do semantic search on your anki collection using AI : I call it Anki SemSearch

Other features

Code is PEP8 compliant, dynamically typed, all the functions have a detailed docstrings. Contributions are welcome, opening issues is encouraged and appreciated.
Keeps the OCR data of pictures in your cards, if you analyzed them beforehand using AnkiOCR.
Can automatically replace acronyms in your cards (e.g. 'PNO' can be replaced to "pneumothorax" if you tell it to), regexp are supported (if you want access to my homemade list of french medical acronyms, open an issue)
Can attribute more importance to some fields of some cards if needed.
Can be used to optimize the order of a deck of new cards, read this thread to know more
Previous feature like clustering, plotting, searching have been removed as I don't expect them to be useful to users. But the code was clean so if you need it for some reason don't hesitate to open an issue.

FAQ

This looks awesome! Are you working on other things like that? Yes and thank you! Another project of mine aims to do semantic search on your anki collection using AI : I call it Anki SemSearch
How does it work? (Layman version) AnnA connects to its companion addon to access your anki collection. This allows to use any python library without having to go through the trouble how packaging those libraries into an addon. It uses a vectorizer to assign numbers (vectors) to each cards. If the numbers of two cards are very close, then the cards have similar content and should not be reviewed too close to each other.
And in more details? The vectorization method AnnA is using, subword TF_IDF is a way to count words in a document (anki cards in this case) and understand which are more important. "subword" here means that I used BERT tokenization (i.e. splitting "hypernatremia" into "hyper na tremia" which can be linked to cards dealing with "Na Cl" for example). TF_IDF is generally considered as part of machine learning. AnnA leverages this information to make sure you won't review cards on the same day that are too similar. This is very useful when you have to many cards to review in a day. I initially tried to combine several vectorizer into a common scoring but it proved unreliable, I also experimented with fastText (only one language at a time, can't be packaged and large RAM usage), sentence-BERT (too large and depends too much on phrasing to be reliable). So I decided to keep it simple and provide only TFIDF. The goal is to review only the most useful cards, kinda like pareto distribution (i.e. review less cards, but review the right one and you should be able to keep afloat in med school). The code is mainly composed of a python class called AnnA. When you instantiate this class, you have to supply the name of the deck you want to filter from. It will then automatically fetch the cards from your collection, then use TF_IDF to assign vectors to each card, compute the distance matrix of the cards and create a filtered deck containing the cards in the optimal order (or bury the cards you don't have to review today). Note that rated cards of the last X days of the same deck will be used as reference to avoid having cards that are too similar to yesterday's reviews too. If you want to know more, either open an issue or read the docstrings in the code.
Will this change anything to my anki collection? It should not modify anything, if you delete the filtered deck, everything will be back to normal. That being said, the project is young and errors might still be present.
Does it work if I have learning steps over multiple days? Yes, that's my use case. AnnA, depending on the chosen task, either only deals with review queue and ignores learning cards and new cards, or only buries the part of your learning cards that are too similar (ignoring due cards). You can use both one after the other every morning like I do. If you have learning cards in your filtered deck it's because you lapsed those cards yesterday.
Does this only work for English cards? No! TF_IDF use a multilingual BERT uncased tokenization so should work on most languages (even if you have several different languages in the same deck).
Can I use this if I don't know python? Yes! Installing the thing might not be easy but it's absolutely doable. And you don't need to know python to run AnnA. I tried my best to make it accessible and help is welcome.
What do you call "optimal review order"? The order that minimizes the chance of reviewing similar cards in a day. You see, Anki has no knowledge of the content of cards and only cares about their interval and ease. Its built-in system of "siblings" is useful but I think we can do better. AnnA was made to create filtered decks sorted by "relative_overdueness" (or other) BUT in a way that keeps semantic siblings far from each other.
When should I use this? It seems good for dealing with the huge backlog you get in medical school, or just everyday to reduce the workload. If you have 2 000 reviews to do, but can only do 500 in a day: AnnA is making sure that you will get the most out of those 500.
How do you use this? I described my routine in a separate file called authors_routine.md.
Can I use AnnA to optimize a list of new cards? I never did it personally but helped a user doing it: see related thread
What are the power requirements to run this? I wanted to make it as efficient as possible but am still improving the code. Computing the distance matrix can be long if you do this on very large amount of cards but this step is done in parallel on all cores so should not be the bottleneck. Let me know if some steps are unusually slow and I will try to optimize it. There are ways to make it way easier on the CPU, see arguments low_power_mode and TFIDF_dim.
How long does it take to run? Is it fast? It takes about 1 min to run for a deck of less than 500 due cards, I've seen it take as long as 4 minutes on a 10 000 due cards deck. This is on a powerful laptop with no GPU on Linux.
Why is creating the queue taking longer and longer? Each card is added to the queue after having been compared to the rated cards of the last few days and the current queue. As the queue grows, more computation have to be done. If this is an issue, consider creating decks that are as big as you think you can review in a day. With recent improvements in the code the speed should really not be an issue.
Does this work only on Linux? It should work on all platform, provided you have anki installed and AnnA-companion enabled. But the script (not the addon) uses libraries that might only work on some CPU architectures, so I'm guessing ARM system would not work but please tell me if you know tried.
What is the current status of this project? I use it daily but am still working on the code. You can expect breaking. I intend to keep developing until I have no free time left. Take a look at the TODO list of the dev branch if you want to know what's in the works. When in doubt, open an issue.
Do you mind if I open an issue? Not at all! It is very much encouraged, even just for a typo. That will at least make me happy. Of course PR are always welcome too.
Can this be made into an anki addon instead of a python script? I have never packaged things into anki addons so I'm not so sure. I heard that packaging complex modules into anki is a pain, and cross platform will be an issue. If you'd like to make this a reality, show yourself by opening an issue! I would really like this to be implemented into anki, and the search function would be pretty nice :)
What version of anki does this run on? I've been using AnnA from anki 2.1.44 and am currently on 2.1.54 Compatibility relies heavily on anki-connect. Please tell me if you run into issues.
If I create a filtered deck using AnnA, how can I rebuild it? You can't rebuilt it or empty it through anki directly as it would leave you with anki's order and not AnnA's. You have to delete the filtered deck then run the script. Hence, I suggest creating large filtered decks in advance.
I don't think the reviews I do on AnnA's filtered decks are saved, wtf? It might be because you're using multiple device and are deleting the filtered deck on one of the device without syncing first.
What is subword TF_IDF? Short for "subword term frequency–inverse document frequency". It's a clever way to split words into subparts then count the parts to figure out which cards are related.
Does it work with images? Not currently but sBERT can be used with CLIP models so I could pretty simply implement this. If you think you would find it useful I can implement it :).
What are the supported languages using TF_IDF? TF_IDF is language agnostic, but the language model used to split the words was trained on the 102 largest wikipedia corpus. If your languages is very weird and non unicode or non standard in some ways you might have issues, don't hesitate to open an issue as I would gladly take a look.
What is the field_mapping.py file? It's a file with a unique python dictionary used by AnnA to figure out which field of which card to keep. Using it is optional. By default, each notetype will see only it's first field kept. But if you use this file you can keep multiple fields. Due to how TF_IDF works, you can add a field multiple times to give it more importance relative to other fields.
What is "XXX - AnnA Optideck"? The default name for the filtered deck created by AnnA. It contains the reviews in the best order for you.
Why are there only reviews and no learning cards in the filtered decks? When task is set to filter_review_cards, AnnA will fetch only review cards and not deal with learning cards. This is because I'm afraid of some weird behavior that would arise if I change the due order of learning cards. Whereas I can change it just find using review cards.
Why does the progress bar of "Computing optimal review order" doesn't always start at 0? It's just cosmetic. At each step of the loop, the next card to review is computed by comparing it against the previously added cards and the cards rated in the last few days. This means that each turn is slower than the last. So the progress bar begins with the number of cards rated in the past to indicate that. It took great care to optimize the loop so it should not really be an issue.
Does AnnA take into account my card's tags ? If you set the argument append_tags to True, the 2 deepest level of each tags are appended at the end of the text and used like if it was part of the content of the card. Note that "_ - and /" are replaced by a space. For example : a::b::c::d some::thing will be turned into c d some thing.
Why does the task "bury_excess_learning_cards" seem to ignore a few cards? I decided to have this task not take into account cards that were failed today or the day before, those are usually top priority and I don't won't AnnA to bury them.
How can I know AnnA is actually working and not burying random cards? Good question, I tried to implement a metric called Improvement ratio that displays a score at the end of the run. It's a sort of sum of the distance between all cards in the new queue over the sum of the distance between all cards in what would have been the queue. It's not really indicative of anything beyond the fact that it's over 1 (=it helped) or under 1 (=it was detrimental). But the latter is kinda hard to get right because it depends on the reliability of reference_order in itself. I am especially doubtful of the meaning of the ratio when dealing with learning cards but so far it seems to work ok. Consider it a work in progress.
I have underlined or put in bold the most important parts of my cards, does it matter? Words that are put in bold or underlined are actually duplicated in the text, this gives them twice as much importance.
What scheduler should I use with AnnA? Can I use the V3 scheduler? I strongly suggest leaving the V1 as it messes with learning cards quite a lot. I personally use V2 and developed AnnA while using V2. I don't fully understand the changes in V3 and am not really certain that it works correctly with AnnA so I can't really say. I plan to investigate switching to V3 around October 2022.

Getting started

First, read this page in its entirety, this is a complicated piece of software and you don't want to use it irresponsibly on your cards. The usage section is especially useful.
Install the addon AnnA Companion (Anki neuronal Appendix) - do LESS reviews with MORE retention!
Clone this repository (for example with git clone https://github.com/thiswillbeyourgithub/AnnA_Anki_neuronal_Appendix)
Install the required python libraries : pip install -r requirements.txt (in case of errors, try using python 3.9)
Edit file field_mapping.py: it contains a dictionary where the keys are notetypes and values are lists of which field to take into account.
Edit file acronym_file.py: it contains dictionaries where keys are words to replace and values are what the words should be replaced with.
There are two ways to run AnnA:
- Either in a Python console : from AnnA import * ; AnnA(YOUR_ARGUMENTS)
- Or directly in the terminal : python3 AnnA.py --help (note that the terminal mode was added after the python console and might still contain error when parsing arguments)
If you want to run AnnA on several decks in a row like I do, edit the file autorun.py then execute it with python3 ./autorun.py
Open an issue telling me your remarks and suggestion

Usage and arguments

AnnA was made with customizability in mind. All the settings you might want to edit are arguments of the call of AnnA Class. Don't be frightened, many of those settings are rarely used and the default values should be good for almost anyone. Here are the arguments with the relevant explanation.

python3 ./AnnA.py --help:

usage: AnnA.py [-h] --deckname DECKNAME [--reference_order REF_ORDER] --task TASK
               --target_deck_size TARGET_SIZE [--stopwords_lang STOPLANG [STOPLANG ...]]
               [--rated_last_X_days RATED_LAST_X_DAYS]
               [--score_adjustment_factor SCORE_ADJUSTMENT_FACTOR]
               [--field_mapping FIELD_MAPPING_PATH] [--acronym_file ACRONYM_FILE_PATH]
               [--acronym_list [ACRONYM_LIST ...]] [--minimum_due MINIMUM_DUE_CARDS]
               [--highjack_due_query HIGHJACK_DUE_QUERY]
               [--highjack_rated_query HIGHJACK_RATED_QUERY] [--low_power_mode]
               [--log_level LOG_LEVEL] [--replace_greek] [--keep_OCR] [--append_tags]
               [--tags_to_ignore [TAGS_TO_IGNORE ...]] [--tags_separator TAGS_SEP]
               [--fdeckname_template FILTER_DECK_NAME_TEMPLATE] [--show_banner]
               [--skip_print_similar] [--vectorizer VECTORIZER]
               [--TFIDF_dim TFIDF_DIMENSIONS] [--TFIDF_tokenize] [--TFIDF_stem]
               [--whole_deck_computation] [--profile_name PROFILE_NAME]
               [--keep_console_open]

optional arguments:
  -h, --help            show this help message and exit
  --deckname DECKNAME   the deck containing the cards you want to review. If you don't
                        supply this value or make a mistake, AnnA will ask you to type in
                        the deckname, with autocompletion enabled (use `<TAB>`). Default is
                        `None`.
  --reference_order REF_ORDER
                        either "relative_overdueness" or "lowest_interval". It is the
                        reference used to sort the card before adjusting them using the
                        similarity scores. Default is `"relative_overdueness"`. Keep in mind
                        that my relative_overdueness is a reimplementation of the default
                        overdueness of anki and is not absolutely exactly the same but
                        should be a close approximation. If you find edge cases or have any
                        idea, please open an issue.
  --task TASK           can be "filter_review_cards", "bury_excess_learning_cards",
                        "bury_excess_review_cards". Respectively to create a filtered deck
                        with the cards, or bury only the similar learning cards (among other
                        learning cards), or bury only the similar cards in review (among
                        other review cards). Default is "`filter_review_cards`".
  --target_deck_size TARGET_SIZE
                        indicates the size of the filtered deck to create. Can be the number
                        of due cards like "100", a proportion of due cards like '80%' or
                        '0.80', the word "all" or "deck_config" to use the deck's settings
                        for max review. Default is `deck_config`.
  --stopwords_lang STOPLANG [STOPLANG ...]
                        a list of languages used to construct a list of stop words (i.e.
                        words that will be ignored, like "I" or "be" in English). Default is
                        `english french`.
  --rated_last_X_days RATED_LAST_X_DAYS
                        indicates the number of passed days to take into account when
                        fetching past anki sessions. If you rated 500 cards yesterday, then
                        you don't want your today cards to be too close to what you viewed
                        yesterday, so AnnA will find the 500 cards you reviewed yesterday,
                        and all the cards you rated before that, up to the number of days in
                        rated_last_X_days value. Default is `4` (meaning rated today, and in
                        the 3 days before today). A value of 0 or `None` will disable
                        fetching those cards. A value of 1 will only fetch cards that were
                        rated today. Not that this will include cards rated in the last X
                        days, no matter if they are reviews or learnings. you can change
                        this using "highjack_rated_query" argument.
  --score_adjustment_factor SCORE_ADJUSTMENT_FACTOR
                        a tuple used to adjust the value of the reference order compared to
                        how similar the cards are. Default is `(1, 5)`. For example: (1,
                        1.3) means that the algorithm will spread the similar cards farther
                        apart.
  --field_mapping FIELD_MAPPING_PATH
                        path of file that indicates which field to keep from which note type
                        and in which order. Default value is `field_mappings.py`. If empty
                        or if no matching notetype was found, AnnA will only take into
                        account the first 2 fields. If you assign a notetype to
                        `["take_all_fields]`, AnnA will grab all fields of the notetype in
                        the same order as they appear in Anki's interface.
  --acronym_file ACRONYM_FILE_PATH
                        a python file containing dictionaries that themselves contain
                        acronyms to extend in the text of cards. For example `CRC` can be
                        extended to `CRC (colorectal cancer)`. (The parenthesis are
                        automatically added.) Default is `"acronym_file.py"`. The matching
                        is case sensitive only if the key contains uppercase characters. The
                        ".py" file extension is not mandatory.
  --acronym_list [ACRONYM_LIST ...]
                        a list of name of dictionaries to extract file supplied in
                        `acronym_file`. Used to extend text, for instance
                        `["AI_machine_learning", "medical_terms"]`. Default to None.
  --minimum_due MINIMUM_DUE_CARDS
                        stops AnnA if the number of due cards is inferior to this value.
                        Default is `5`.
  --highjack_due_query HIGHJACK_DUE_QUERY
                        bypasses the browser query used to find the list of due cards. You
                        can set it for example to `deck:"my_deck" is:due -rated:14 flag:1`.
                        Default is `None`. **Keep in mind that, when highjacking queries,
                        you have to specify the deck otherwise AnnA will compare your whole
                        collection.**
  --highjack_rated_query HIGHJACK_RATED_QUERY
                        same idea as above, bypasses the query used to fetch rated cards in
                        anki. Related to `highjack_due_query` although you can set only one
                        of them. Default is `None`.
  --low_power_mode      enable to reduce the computation needed for AnnA, making it usable
                        for less powerful computers. Default to `False`. In more details, it
                        mainly reduces the argument `ngram_range` for TFIDF, making it use
                        unigrams instead of n-grams with n from 1 to 3. It also skips trying
                        to find acronyms that were not replaced as well as identifying
                        similar cards.
  --log_level LOG_LEVEL
                        can be any number between 0 and 2. Default is `2` to only print
                        errors. 1 means print also useful information and >=2 means print
                        everything. Messages are color coded so it might be better to leave
                        it at 3 and just focus on colors.
  --replace_greek       if True, all greek letters will be replaced with a spelled version.
                        For example `σ` becomes `sigma`. Default is `True`.
  --keep_OCR            if True, the OCR text extracted using the great AnkiOCR addon
                        (https://github.com/cfculhane/AnkiOCR/) will be included in the
                        card. Default is `True`.
  --append_tags         Wether to append the 2 deepest tags to the cards content or to add
                        no tags.
  --tags_to_ignore [TAGS_TO_IGNORE ...]
                        a list of tags to ignore when appending tags to cards. Default is
                        `None`, to ignore.
  --tags_separator TAGS_SEP
                        separator between levels of tags. Default to `::`.
  --fdeckname_template FILTER_DECK_NAME_TEMPLATE
                        name template of the filtered deck to create. Only available if task
                        is set to "filter_review_cards". Default is `None`.
  --show_banner         used to display a nice banner when instantiating the collection.
                        Default is `True`.
  --skip_print_similar  default to `False`. Skip printing example of cards that are very
                        similar or very different. This speeds up execution but can help
                        figure out when something when wrong.
  --vectorizer VECTORIZER
                        can nowadays only be set to "TFIDF", but kept for legacy reasons.
  --TFIDF_dim TFIDF_DIMENSIONS
                        the number of dimension to keep using SVD Default is `100`, you
                        cannot disable dimension reduction for TF_IDF because that would
                        result in a sparse matrix.
                        https://scikit-learn.org/stable/modules/generated/sklearn.decomposit
                        ion.TruncatedSVD.html).
  --TFIDF_tokenize      default to `True`. Enable sub word tokenization, for example turn
                        `hypernatremia` to `hyp + er + natr + emia`. The current tokenizer
                        is `bert-base-multilingual-cased` and should work on just about any
                        languages. You cannot enable both `TFIDF_tokenize` and `TFIDF_stem`
                        but should absolutely enable at least one.
  --TFIDF_stem          default to `False`. Wether to enable stemming of words. Currently
                        the PorterStemmer is used, and was made for English but can still be
                        useful for some other languages. Keep in mind that this is the
                        longest step when formatting text.
  --whole_deck_computation
                        defaults to `True`. Use ankipandas to extract all text from the deck
                        to feed into the vectorizer. Results in more accurate relative
                        distances between cards. (more information at
                        https://github.com/klieret/AnkiPandas)
  --profile_name PROFILE_NAME
                        defaults to `None`. Profile named used by ankipandas to find your
                        collection. If None, ankipandas will use the most probable
                        collection.
  --keep_console_open   defaults to `False`. Set to True to open a python console after
                        running.

AnnA includes built-in methods you can run after instantiating the class. Note that methods beginning with a "_" are not supposed to be called by the user and are reserved for backend use. Here's a list of sometimes useful methods:

note: to call python method after running anna outside of a console, use argument --keep_console_open and access the variable anna. For example anna.display_best_review_order()

display_best_review_order used as a debugging tool : only display order. Allows to check if the order seems correct without having to create a filtered deck.
save_df saves the dataframe containing the cards and all other infos needed by AnnA as a pickle file. Used mainly for debugging. Files will be saved to the folder DF_backups

TODO

see dev branch

Credits and links that were helpful

Corentin Sautier for his many many insights and suggestions on ML and python. He was also instrumental in devising the score formula used to order the filtered deck.
A post about class based tf idf by Maarten Grootendorst on Towardsdatascience
The authors of sentence-bert and their very informative website
The author of the addon anki-connect, as this project was indispensable when developing this addon. The companion addon is a reduced fork from anki-connect.

Crazy ideas

The following is kept as legacy but was made while working on the ancestor of AnnA, don't judge please.

Disclaimer : I'm a medical student extremely interested in AI but who has trouble devoting time to this passion. This project is a way to get to know machine learning tools but can have practical applications. I like to have crazy ideas to motivate my projects and they are listed belows. Don't laugh. Don't hesitate to contribute either.

Scheduling Using AnnA to do more different reviews might, somehow, increase the likelihood of eureka moments where your brain just created a new neural paths. That would supposedly help to remember and master a field.
Optimize learning via cues : A weird idea of mine is about the notion that hearing a tone when you're learning something will increase your recall if you have the same tone playing while taking the test. So maybe it would be useful to assign a tone that increases in pitch while you advance in the queue of due cards? I told you it was crazy ideas... Also this tone could play the tone assigned to the cluster when doing random reviews.
Mental Walk (credit goes to a discussion with Paul Bricman) : integrating AnnA with VR by offering the user to walk in an imaginary and procedurally generated world with the reviews materialized in precise location. So the user would have to walk to the flashcard location then do the review. The cards would be automatically grouped by cluster into rooms or forests or buildings etc. Allowing to not have to create the mental palace but just have to create the flashcards.
- a possibility would be to do a dimension reduction to 5 dimensions. Use the 2 first to get the spatial location were the cluster would be assigned. Then the cards would be spread across the room but the 3 remaining vectors could be used to derive something about the room like wall whiteness, floor whiteness and the tempo of a music playing in the background.
- we could ensure that the clusters would always stay in the same room even after adding many cards or even across users by querying a large language model for the vectors associated to the main descriptors of the cluster.
Source Walking : it would be interesting to do the anki reviews in VR where the floor would be your source (pdf, epub, ppt, whatever). The cards would be spread over the document, approximately located above the actual source sentence. Hence leveraging the power of mental palace while doing your reviews. Accessing the big picture AND the small picture.

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

thiswillbeyourgithub / AnnA_Anki_neuronal_Appendix

Programming Languages

Labels

Projects that are alternatives of or similar to AnnA Anki neuronal Appendix