gammagl.datasets.IMDB¶
- class IMDB(root: str | None = None, transform: Callable | None = None, pre_transform: Callable | None = None, force_reload: bool = False)[source]¶
A subset of the Internet Movie Database (IMDB), as collected in the “MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding” paper. IMDB is a heterogeneous graph containing three types of entities - movies (4,278 nodes), actors (5,257 nodes), and directors (2,081 nodes). The movies are divided into three classes (action, comedy, drama) according to their genre. Movie features correspond to elements of a bag-of-words representation of its plot keywords.
- Parameters:
root (str, optional) – Root directory where the dataset should be saved.
transform (callable, optional) – A function/transform that takes in an
gammagl.data.HeteroGraph
object and returns a transformed version. The data object will be transformed before every access. (default:None
)pre_transform (callable, optional) – A function/transform that takes in an
gammagl.data.HeteroGraph
object and returns a transformed version. The data object will be transformed before being saved to disk. (default:None
)(bool (force_reload) – (default:
False
)optional) (Whether to re-process the dataset.) – (default:
False
)
- url = 'https://www.dropbox.com/s/g0btk9ctr1es39x/IMDB_processed.zip?dl=1'¶
- property raw_file_names: List[str]¶
The name of the files in the
self.raw_dir
folder that must be present in order to skip downloading.