Biobridge Dataloader
Loads the BioBridge dataset and prepares it for training and evaluation using LightningDataModule from PyTorch Lightning, with optional caching.
BioBridgeDataModule
Bases: LightningDataModule
LightningDataModule for the BioBridge dataset.
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 |
|
__init__(cfg)
Initializes the BioBridgeDataModule.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
cfg
|
DictConfig
|
Configuration object with dataset parameters. |
required |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 |
|
prepare_data()
Loads and processes the data, optionally using cached data. If cache is invalid or not found, processes data freshly.
Returns:
Type | Description |
---|---|
None
|
None |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 |
|
setup(stage=None)
Sets up training, validation, and test splits using RandomLinkSplit.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
stage
|
Optional[str]
|
Optional stage indicator. |
None
|
Returns:
Type | Description |
---|---|
None
|
None |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 |
|
state_dict()
Returns the internal state of the data module.
Returns:
Name | Type | Description |
---|---|---|
dict |
Dict[Any, Any]
|
Empty dictionary (no state to save). |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
154 155 156 157 158 159 160 161 |
|
test_dataloader()
Returns the test dataloader.
Returns:
Name | Type | Description |
---|---|---|
GeoDataLoader |
DataLoader
|
DataLoader for test set. |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
140 141 142 143 144 145 146 147 148 149 |
|
train_dataloader()
Returns the training dataloader.
Returns:
Name | Type | Description |
---|---|---|
GeoDataLoader |
DataLoader
|
DataLoader for training set. |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
118 119 120 121 122 123 124 125 126 127 |
|
val_dataloader()
Returns the validation dataloader.
Returns:
Name | Type | Description |
---|---|---|
GeoDataLoader |
DataLoader
|
DataLoader for validation set. |
Source code in vpeleaderboard/data/src/kg/biobridge_datamodule_hetero.py
129 130 131 132 133 134 135 136 137 138 |
|