Other

zoology

Understand and test language model architectures on synthetic tasks.

OtherEmerging

GitHub

Stars

—

Forks

—

Contributors

8

Last push

9d ago

Recent commits

Latest commits.

add a simple stacked version of mqar that can be solved by a 1 layer sequence mixer (no short convs needed for the shift); add a continuous model (i.e., that operates in embedding space rather than discrete tokens); add more loss functions for training (dot-product ce, mse)

d88d6f5Simran Arora5mo ago

clean up pass

dbcb60dSimran Arora5mo ago

nit

a260157Simran Arora5mo ago