Project Gutenberg is a valuable resource for accessing a vast collection of public domain texts. However, the texts on Project Gutenberg often contain unnecessary boilerplate, making it difficult for users to extract the actual content. While there are existing tools like Gutenizer aimed at addressing this issue, they often fall short in fully removing the …