Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ The model consists of 12 layers with a model dimension of 768, and a feedforward
|
|
8 |
# Training data
|
9 |
GPT-J 162B was pre-trained on the Pile, a large-scale curated dataset created by EleutherAI. It was then finetuned on synthetically generated data that was procedurally generated using the Rhinocers/Grasshopper software suite. The model was finetuned for 1.25 billion tokens over 11,500 steps on TPU v3-8. It was trained as an autoregressive language model, using cross-entropy loss to maximize the likelihood of predicting the next token correctly.
|
10 |
|
11 |
-
#Intended Use and Limitations
|
12 |
Architext models learn an inner representation of the architectural design that can be used to generate a larger diversity of geometric designs and can be useful for many downstream design workflows and tasks. While it could be adapted to many different design outputs, the model is best at generating residential floor plans given a natural language prompt.
|
13 |
|
14 |
# How to use
|
@@ -45,5 +45,5 @@ To cite the codebase that trained this model:
|
|
45 |
month = May
|
46 |
}
|
47 |
|
48 |
-
#Acknowledgements
|
49 |
This project would not have been possible without compute generously provided by Google through the TPU Research Cloud that generously provided access to Clout TPU VMs used to finetune this model.
|
|
|
8 |
# Training data
|
9 |
GPT-J 162B was pre-trained on the Pile, a large-scale curated dataset created by EleutherAI. It was then finetuned on synthetically generated data that was procedurally generated using the Rhinocers/Grasshopper software suite. The model was finetuned for 1.25 billion tokens over 11,500 steps on TPU v3-8. It was trained as an autoregressive language model, using cross-entropy loss to maximize the likelihood of predicting the next token correctly.
|
10 |
|
11 |
+
# Intended Use and Limitations
|
12 |
Architext models learn an inner representation of the architectural design that can be used to generate a larger diversity of geometric designs and can be useful for many downstream design workflows and tasks. While it could be adapted to many different design outputs, the model is best at generating residential floor plans given a natural language prompt.
|
13 |
|
14 |
# How to use
|
|
|
45 |
month = May
|
46 |
}
|
47 |
|
48 |
+
# Acknowledgements
|
49 |
This project would not have been possible without compute generously provided by Google through the TPU Research Cloud that generously provided access to Clout TPU VMs used to finetune this model.
|