- Create a script titled
import_<dataset name>.py. Choose a dataset name that is concise, yet descriptive. - Convert the raw data into an
xgihypergraph, using the above script. Examples of importing are in the code folder. - Save the dataset to a JSON file using the
xgi.write_json()method.
- Navigate to the XGI page on Zenodo.
- Click the "New Upload" button. This should prompt you to log into Zenodo and will bring up the form to upload a new dataset.
- Enter the information in the "new upload" form:
- In the "Files" section, drag and drop the file or click the "Upload files" button.
- When asked "Do you already have a DOI for this upload?", select "No".
- Under "Resource type", select "Dataset" from the dropdown list.
- Under "Title" enter the dataset name selected above.
- Under "Creators", add yourself with name, ORCID, and affiliation along with your role (typically "Data collector" or "Data curator")
- Under "Description" write the name of the dataset, where it is from, how it was collected, what nodes and edges are, and some basic statistics about the dataset.
- Under "Version", type "v0.0" if this is the first version of the dataset.
- Click the "Submit for review" button. This will send it to the XGI-DATA moderators for review.
Once the dataset has been added to Zenodo, do the following:
- Fork XGI-DATA.
- Move the import script created prior to the
codefolder. - Add an entry (in alphabetical order) in
index.jsonwith:- The dataset name as the key (all lowercase!)
- The value as a dictionary
{"url": <url>} - The url can be found by going to the XGI page on Zenodo, and clicking on the record you just made. Then the url is
https://zenodo.org/records/<number>/<dataset name>.json.
- In
README.md, add the dataset name (alphabetically) as a hyperlink with the Zenodo page url. - Run
get_stats.jsonwith the dataset name as an argument inload_xgi_data(). If every cell in this notebook is run, it will update theindex.jsonfile and add a plot of the degree/edge size distribution.