CHI Note 2017 - Fanfiction Metadata Download
img "; img "; img "; img "; img "; img ";

With its roots dating to popular television shows of the 1960s such as Star Trek, fanfiction has blossomed into an extremely widespread form of creative expression. The transition from printed zines to online fanfiction repositories has facilitated this growth in popularity, with millions of fans writing stories and adding daily to sites such as Archive Of Our Own,,, and many others. Enthusiasts are sharing their writing, reading stories written by others, and helping each other to grow as writers. Yet, this domain is often undervalued by society and understudied by researchers. To facilitate the study of this large but often marginalized community, we present a fully anonymized data release (via differential privacy) of the metadata from a large fanfiction site (to protect author privacy, story, profile, and review text is excluded, and only metadata is provided). We use visual analytics techniques to draw several intriguing insights from the data and show the potential for future research. We hope other researchers can use this data to explore further questions related to online fanfiction communities.

Cite Our Work

Please cite our CHI 2017 Note if you use our database in your work.

"Where No One Has Gone Before: A Meta-Dataset of the World's Largest Fanfiction Repository," Kodlee Yin, Cecilia Aragon, Sarah Evans, Katie Davis. CHI 2017: ACM Conference on Human Factors in Computing Systems. (2017).
What We Provide

All data entries are anonymized and will satisfy differential privacy constraints so that information cannot be traced back and associated to the individual that produced it. The information we're releasing contains only the metadata of the stories published on a large fanfiction repository. The data contains the following columns, given in order from left to right.

csv (649 MB) xml (723 MB)