gammagl.datasets.WebKB

class WebKB(root=None, name='cornell', transform=None, pre_transform=None, force_reload: bool = False)[source]

The WebKB datasets used in the “Geom-GCN: Geometric Graph Convolutional Networks” paper. Nodes represent web pages and edges represent hyperlinks between them. Node features are the bag-of-words representation of web pages. The task is to classify the nodes into one of the five categories, student, project, course, staff, and faculty.

Parameters:
  • root (str, optional) – Root directory where the dataset should be saved.

  • name (str) – The name of the dataset. ("Cornell", "Texas", "Wisconsin")

  • transform (callable, optional) – A function/transform that takes in an gammagl.data.Graph object and returns a transformed version. The data object will be transformed before every access. (default: None)

  • pre_transform (callable, optional) – A function/transform that takes in an gammagl.data.Graph object and returns a transformed version. The data object will be transformed before being saved to disk. (default: None)

  • (bool (force_reload) – (default: False)

  • optional) (Whether to re-process the dataset.) – (default: False)

url = 'https://raw.githubusercontent.com/graphdml-uiuc-jlu/geom-gcn/master'
property raw_dir
property processed_dir
property raw_file_names

The name of the files in the self.raw_dir folder that must be present in order to skip downloading.

property processed_file_names

The name of the files in the self.processed_dir folder that must be present in order to skip processing.

download()[source]

Downloads the dataset to the self.raw_dir folder.

process()[source]

Processes the dataset to the self.processed_dir folder.