zxhezexin commited on
Commit
a1fcb55
1 Parent(s): 5635e24

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +74 -0
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ datasets:
4
+ - allenai/objaverse
5
+ pipeline_tag: image-to-3d
6
+ ---
7
+
8
+ # Model Card for OpenLRM V1.1
9
+
10
+ ## Overview
11
+
12
+ - This model card is for the [OpenLRM](https://github.com/3DTopia/OpenLRM) project, which is an open-source implementation of the paper [LRM](https://arxiv.org/abs/2311.04400).
13
+ - Information contained in this model card corresponds to [Version 1.1](https://github.com/3DTopia/OpenLRM/releases).
14
+
15
+ ## Model Details
16
+
17
+ - Training data
18
+
19
+ | Model | Training Data |
20
+ | :---: | :---: |
21
+ | [openlrm-obj-small-1.1](https://huggingface.co/zxhezexin/openlrm-obj-small-1.1) | Objaverse |
22
+ | [openlrm-obj-base-1.1](https://huggingface.co/zxhezexin/openlrm-obj-base-1.1) | Objaverse |
23
+ | [openlrm-obj-large-1.1](https://huggingface.co/zxhezexin/openlrm-obj-large-1.1) | Objaverse |
24
+ | [openlrm-mix-small-1.1](https://huggingface.co/zxhezexin/openlrm-mix-small-1.1) | Objaverse + MVImgNet |
25
+ | [openlrm-mix-base-1.1](https://huggingface.co/zxhezexin/openlrm-mix-base-1.1) | Objaverse + MVImgNet |
26
+ | [openlrm-mix-large-1.1](https://huggingface.co/zxhezexin/openlrm-mix-large-1.1) | Objaverse + MVImgNet |
27
+
28
+ - Model architecture (version==1.1)
29
+
30
+ | Type | Layers | Feat. Dim | Attn. Heads | Triplane Dim. | Input Res. | Image Encoder | Size |
31
+ | :---: | :----: | :-------: | :---------: | :-----------: | :--------: | :---------------: | :---: |
32
+ | small | 12 | 512 | 8 | 32 | 224 | dinov2_vits14_reg | 446M |
33
+ | base | 12 | 768 | 12 | 48 | 336 | dinov2_vitb14_reg | 1.04G |
34
+ | large | 16 | 1024 | 16 | 80 | 448 | dinov2_vitb14_reg | 1.81G |
35
+
36
+ - Training settings
37
+
38
+ | Type | Rend. Res. | Rend. Patch | Ray Samples |
39
+ | :---: | :--------: | :---------: | :---------: |
40
+ | small | 192 | 64 | 96 |
41
+ | base | 288 | 96 | 96 |
42
+ | large | 384 | 128 | 128 |
43
+
44
+ ## Notable Differences from the Original Paper
45
+
46
+ - We do not use the deferred back-propagation technique in the original paper.
47
+ - We used random background colors during training.
48
+ - The image encoder is based on the [DINOv2](https://github.com/facebookresearch/dinov2) model with register tokens.
49
+ - The triplane decoder contains 4 layers in our implementation.
50
+
51
+ ## License
52
+
53
+ - The model weights are released under the [Creative Commons Attribution-NonCommercial 4.0 International License](LICENSE_WEIGHT).
54
+ - They are provided for research purposes only, and CANNOT be used commercially.
55
+
56
+ ## Disclaimer
57
+
58
+ This model is an open-source implementation and is NOT the official release of the original research paper. While it aims to reproduce the original results as faithfully as possible, there may be variations due to model implementation, training data, and other factors.
59
+
60
+ ### Ethical Considerations
61
+
62
+ - This model should be used responsibly and ethically, and should not be used for malicious purposes.
63
+ - Users should be aware of potential biases in the training data.
64
+ - The model should not be used under the circumstances that could lead to harm or unfair treatment of individuals or groups.
65
+
66
+ ### Usage Considerations
67
+
68
+ - The model is provided "as is" without warranty of any kind.
69
+ - Users are responsible for ensuring that their use complies with all relevant laws and regulations.
70
+ - The developers and contributors of this model are not liable for any damages or losses arising from the use of this model.
71
+
72
+ ---
73
+
74
+ *This model card is subject to updates and modifications. Users are advised to check for the latest version regularly.*