Module 01 - The Site Class

Import pathlib

from pathlib import Path

In this module we'll build up a Site Class that will set configuration values and create the root structure of our static site. We'll also create a command line tool using the Typer library. Since we are going to be working with paths let's import pathlib, which is part of the standard library.

Open the site.py located in the ssg directory. At the top import Path from pathlib.

Create a Class

class Site:
    def __init__(self, source, dest):
        self.source = Path(source)
        self.dest = Path(dest)

Below the import you just wrote, create a class called Site. Next, create a Site class constructor that accepts three arguments self, source, and dest.

In the constructor wrap both source, and dest with a call to Path(). Assign the results of these calls to class attributes with the same names using self.

Find Root Directory

def create_dir(self, path):
    directory = self.dest / path.relative_to(self.source)

Still in the Site class create a method called create_dir that accepts two arguments self, and path. In the body of the create_dir method assign a variable called directory a pathlib path that has two parts. It should start with self.dest and end with a path relative_to self.source.

Make a Directory

directory.mkdir(parents=True, exist_ok=True)

On a new line in the create_dir method, call the mkdir method on directory. For our scenario we want directory to be replaced if it exists. Pass the following keyword arguments to mkdir:

parents set to True
exist_ok set to True

Make the Destination Directory

def build(self):
    self.dest.mkdir(parents=True, exist_ok=True)

Create a new method called build in the Site class. Call the mkdir method on self.dest. As with other mkdir calls, pass the following keyword arguments to mkdir:

parents set to True
exist_ok set to True

Recreate all Paths

for path in self.source.rglob("*"):
    if path.is_dir():
        self.create_dir(path)

Still in the build method, create a for loop that iterates through the paths of self.source.rglob(*). Call the current iteration path. In the body of the for loop test if the current path is a directory. If it is a directory call the create_dir method of the class and pass in the current path.

Import Typer

import typer
from ssg.site import Site

Lets setup the command line interface (CLI), open the ssg.py file in the root directory of the project. At the top import typer. Also, import the Site class from ssg.site.

Configuration Options

def main(source="content", dest="dist"):
    config = {
        "source": source,
        "dest": dest
    }

The Typer library requires a function to run that captures command line arguments. We'll call this function main. It should accept two keyword arguments, source with a default value of "content" and dest with a default value of "dist". In the body of the main function create a dictionary called config. Add two key value pairs to config, "source" set to source, and "dest" set to dest.

Build the Site

Site(**config).build()

Still in the main function, create an instance of the Site class. The Site class requires that you provide two attributes source and dest when creating an instance. These are currently store in the config dictionary as key value pairs. Unpack these dictionary values with ** and pass it to the Site instance. Finally, chain a call to the build() method on this instance.

Run Typer

typer.run(main)

At the bottom of the file, typer.run() the main function.

Parser Class

from typing import List

class Parser:
    extensions: List[str] = []

In this module we will create a Parser base class that will have several functions that will help when converting Markdown and ReStructuredText to HTML. To start, open the ssg/parsers.py file. We will add a few type annotations and one requires an import. Import List from typing. Also, import Path from pathlib. Next, create a class called Parser. Create a variable called extensions and assign it an empty list. Annotate extensions with the type List[str].

Validate Extensions

def valid_extension(self, extension):
    return extension in self.extensions

We will need to know whether certain files have a parser. This will be done by looking at the extension. Create a new method in the Parser class called valid_extension. This method should accept an extension and return whether or not that extension is in the class variable self.extensions. Hint: This method is part of the Parser methods so it should accept self as an argument.

Base parse method

def parse(self, path: Path, source: Path, dest: Path):
    raise NotImplementedError

Since the Parser class is a base class we will create a method that will be overwritten in a Sub-class. Call this method parse, it should accept a path, source, and dest. Annotate each of these with the Path type. In the body raise the NotImplementedError.

Parser read method

def read(self, path):
    with open(path, "r") as file:
        return file.read()

The Parser class with need to be able to read the contents of a file. Create a method called read that accepts a path. Use a with statement and a call to open() to open path for reading as file. In the body of the with statement return a what is read() from file.

Parser write method

def write(self, path, dest, content, ext=".html"):
    full_path = dest / path.with_suffix(ext).name

Still in the Parser class create a method called write that accepts the following arguments path, dest, and content. Also, add a keyword argument called ext with a default value of ".html". In the body of the method assign a variable called full_path a pathlib path that has two parts. It should start with dest and end with the name of the path with_suffix() of ext.

Open file for writing

with open(full_path, "w") as file:
    file.write(content)

Still in the write method use with and open() to open path for writing as file. In the body of the with statement write content to file.

Parser copy method

import shutil
def copy(self, path, source, dest):
    shutil.copy2(path, dest / path.relative_to(source))

Move back to the top of the page and import shutil. We'll this use this library to copy resources to the correct location. Below the exiting methods in the Parser class create a new method called copy. This method should accept the following arguments path, source, and dest. Use copy2 from shutil to copy the file at path to the root of dest with a second path relative_to source.

ResourceParser subclass

class ResourceParser(Parser):
    extensions = [".jpg", ".png", ".gif", ".css", ".html"]

    def parse(self, path, source, dest):
        self.copy(path, source, dest)

Create a class called ResourceParser that is a sub-class of Parser. Next, create a Site class constructor that accepts three arguments self, source, and dest. In the constructor wrap both source, and dest with a call to Path(). Assign the results of these calls to class attributes with the same names using self. Create a variable called extensions and assign it a list with 5 extensions, ".jpg", ".png", ".gif", ".css", and ".html".

Implement the parse method in the ResourceParser class. It should have the same signature as in the base class. In the body, call the copy method that is part of the class because we inherited from Parser. Pass in the path, source and dest.

Available Parsers

, parsers=None
self.parsers = parsers or []

Open ssg/site.py and add a keyword argument to the constructor called parsers set to None. In the body of the constructor, set a new class attribute called parsers to parsers or an empty list.

Parser Configuration

import ssg.parsers
"parsers": [
    ssg.parsers.ResourceParser(),
],

Open ssg.py, at the top import ssg.parsers. Add a new key value pair to the config dictionary in the main function. The key should be "parsers" and the value should be a list with a single element of ssg.parsers.ResourceParser().

Site class load parser method

def load_parser(self, extension):
    for parser in self.parsers:
        if parser.valid_extension(extension):
            return parser

Back in ssg/site.py add a new method to the Site class called load_parser below the existing methods. This method should accept an argument called extension. The first statement in the method should be a for loop that cycles through self.parsers. Call the current loop value parser. The body should have an if statement that tests if extension is a valid_extension. Hint: parser is an instance of the correct Parser class. So it will have a method of valid_extension. Return parser in the if statement.

Site class run parser method

def run_parser(self, path):
    parser = self.load_parser(path.suffix)

Still in the Site class, add a new method called run_parser. This method should accept an argument called path. In the method call load_parser passing in path.suffix, save the result to a variable called parser.

Call the parse method

if parser is not None:
    parser.parse(path, self.source, self.dest)
else:
    print('Not Implemented')

Still in the run_parser method test if parser is not None. If parser is not None then call the parse method of parser. Pass in the path and the Site class attributes of source and dest. In an else print the message Not Implemented.

Run the parser

elif path.is_file():
    self.run_parser(path)

To connect everything together find the if statement in the build method. Add an elif that tests whether path is a file. If path is a file then call run_parser passing in path. Hint: run_parser is part of the Site class.

Imports

import re

from collections.abc import Mapping

from yaml import load, FullLoader

In this module we will extract YAML frontmatter from files in our site. We this will be done by spliting the contents of the file. This will require the use of regular expressions. Import re from the standard library. We will also need Mapping from collections.abc. Finally, import load and FullLoader from yaml.

Content class

class Content(Mapping):
    __delimiter = r"^(?:-|\+){3}\s*$"
    __regex = re.compile(__delimiter, re.MULTILINE)

Below the imports create a class called Content, make sure that it inherits from Mapping. In the new class create a class variable called __delimeter, assign it the raw string "^(?:-|+){3}\s*$". Create another class variable called __regex, assign it the result of a call to re.compile(), pass in __delimiter and the constant re.MULTILINE.

The load class method

@classmethod
def load(cls, string):
    _, fm, content = cls.__regex.split(string, 2)
    metadata = load(fm, Loader=FullLoader)

    return cls(metadata, content)

Create a new method called load in the Content class, make it a class method with the appropriate decorator. This method should accept two arguments cls and string. In the body of the load method call split on the __regex class variable passing in string and a depth of 2. Assign the result of this to 3 variables _, fm, content. Next, on a new line call load and pass in fm and a keyword argument of Loader set to FullLoader. Finally, return a call to cls() and pass in metadata and content.

Content constructor

def __init__(self, metadata, content):
    self.data = metadata
    self.data["content"] = Content

Create a Content class constructor below the load method. The constructor should accept two arguments, metadata and content. Create a class attribute named data and assign it metadata. On a new line, add a key value pair to self.data of "content" set to content.

Body property

@property
def body(self):
    return self.data["content"]

Still in the Content class add a class @property of body that returns self.data["content"].

Type property

@property
def type(self):
    return self.data["type"] if "type" in self.data else None

Also in the Content class add a class @property of type that returns self.data["type"] if self.data has a key of type. If that key doesn't exist then return None. This needs to be done with a ternary if.

Type setter

@type.setter
def type(self, type):
    self.data["type"] = type

Create a setter for the type @property that assigns self.data["type"].

Custom getitem method

def __getitem__(self, key):
    return self.data[key]

Let's overwrite the default __getitem__ method so that it returns the value from self.data[] for the key that is provided.

Custom iterator method

def __iter__(self):
    self.data.__iter__()

The Content class will also have a custom __iter__ method that calls self.datas iterator method.

Custom length method

def __len__(self):
    return len(self.data)

We'll also need a custom __len__ method that returns the length of self.data.

Content Representation

def __repr__(self):
    data = {}
    return str(data)

The final method that we will overwrite is the __repr__ method. It will create a custom representation of self.data. Create a __repr__ method and on the first line create an empty dictionary called data. Return a call to str() passing in data.

Removing content from the Representation

for key, value in self.data.items():
    if key != "content":
        data[key] = value

We would only like the custom representation to include certain values. Loop through self.data.items() with a for loop. The current key should be called key and the value should be value. In the for loop test if key is not equal to "content". In the if assign the value to data[key].

Markdown and ReStructuredText Imports

from docutils.core import publish_parts
from markdown import markdown

from ssg.content import Content

In this module we will convert Markdown and ReStructuredText to HTML. Import the following:

publish_parts from docutils.core
markdown from markdown
Content from ssg.content

Markdown Parser

class MarkdownParser(Parser):
    extensions = [".md", ".markdown"]

We have already created a Parser sub-class let's create another sub-class called MarkdownParser. With in the new MarkdownParser class create a variable called extensions. This should be assigned a list with the extensions .md and .markdown.

Markdown parse method

def parse(self, path, source, dest):
    content = Content.load(self.read(path))

Implement the parse method in the MarkdownParser class. It should have the same signature as in the base class. In the body, call the Content.load class method and pass in a call to self.read() and to that pass path. Assign the result to a variable called content.

Converting markdown to html

html = markdown(content.body)
self.write(path, dest, html)

In the body of the parse method call the markdown method and pass in content.body. Assign the results to a variable called html. Use self.write() to write html to path at dest.

ReStructuredText Parser

class ReStructuredTextParser(Parser):
    extensions = [".rst"]

Create another sub-class called ReStructuredTextParser. With-in the new ReStructuredTextParser class create a variable called extensions. This should be assigned a list with the extension .rst.

ReStructuredText parse method

def parse(self, path, source, dest):
    content = Content.load(self.read(path))

Implement the parse method in the ReStructuredTextParser class. It should have the same signature as in the base class. In the body, call the Content.load class method and pass in a call to self.read() and to that pass path. Assign the result to a varaible called content.

Converting ReStructuredText to html

html = publish_parts(content.body, writer_name="html5")
self.write(path, dest, html["html_body"])

In the body of the parse method call the publish_parts method and pass in content.body, also, add a keyword argument of writer_name set to "html5". Assign the results to a variable called html. Use self.write() to write the html["html_body"] to path at dest.

Error reporting static method

import sys

@staticmethod
def error(message, end="\n"):
    sys.stderr.write("\x1b[1;31m" + message.strip() + "\x1b[0m" + end)

Switch over to ssg/site.py and at the top import sys.

Then, below all other methods in the Site class create a static method with the @staticmethod decorator called error. Since this is a static method it does not need to accept self but it does need two accept two arguments message and end with a default value of "\n".

In the body of the error method call thesys.stderr.write() method. Pass in the string "\x1b[1;31m" concatenated with message.strip(), "\x1b[0m" and finally end.

Calling the error static method

self.error(
    "No parser for the `{}` extension, file skipped!".format(
        path.suffix
    )
)

Find the run_parser method in the Site class and replace the print() call with a call to self.error(). Then, pass in the message "No parser for the {} extension, file skipped!". Append to this string a call to format() passing in path.suffix.

Files

tasks.md

Latest commit

History