自良 commited on
Commit
f29dd73
·
1 Parent(s): 86ec9d5

update leaderboard

Browse files
arena_elo/cut_off_date.txt CHANGED
@@ -1 +1 @@
1
- 20250116
 
1
+ 20250201
arena_elo/results/20250201/clean_battle.json ADDED
@@ -0,0 +1,1442 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "model_a": "GPT-4o + FLUX.1 [dev]",
4
+ "model_b": "ChatDiT",
5
+ "winner": "model_b",
6
+ "judge": "arena_user_127.0.0.1",
7
+ "anony": true,
8
+ "tstamp": 1735030427.6669
9
+ },
10
+ {
11
+ "model_a": "GPT-4o + FLUX.1 [dev]",
12
+ "model_b": "ChatDiT",
13
+ "winner": "model_a",
14
+ "judge": "arena_user_127.0.0.1",
15
+ "anony": true,
16
+ "tstamp": 1735030452.0238
17
+ },
18
+ {
19
+ "model_a": "ChatDiT",
20
+ "model_b": "GPT-4o + FLUX.1 [dev]",
21
+ "winner": "model_a",
22
+ "judge": "arena_user_127.0.0.1",
23
+ "anony": true,
24
+ "tstamp": 1735030464.2602
25
+ },
26
+ {
27
+ "model_a": "ChatDiT",
28
+ "model_b": "GPT-4o + FLUX.1 [dev]",
29
+ "winner": "model_a",
30
+ "judge": "arena_user_127.0.0.1",
31
+ "anony": true,
32
+ "tstamp": 1735030476.2328
33
+ },
34
+ {
35
+ "model_a": "GPT-4o + FLUX.1 [dev]",
36
+ "model_b": "ChatDiT",
37
+ "winner": "tie (bothbad)",
38
+ "judge": "arena_user_127.0.0.1",
39
+ "anony": true,
40
+ "tstamp": 1735030495.2955
41
+ },
42
+ {
43
+ "model_a": "ChatDiT",
44
+ "model_b": "GPT-4o + FLUX.1 [dev]",
45
+ "winner": "tie (bothbad)",
46
+ "judge": "arena_user_127.0.0.1",
47
+ "anony": true,
48
+ "tstamp": 1735030503.418
49
+ },
50
+ {
51
+ "model_a": "ChatDiT",
52
+ "model_b": "GPT-4o + FLUX.1 [dev]",
53
+ "winner": "model_a",
54
+ "judge": "arena_user_127.0.0.1",
55
+ "anony": true,
56
+ "tstamp": 1735030511.3926
57
+ },
58
+ {
59
+ "model_a": "ChatDiT",
60
+ "model_b": "GPT-4o + FLUX.1 [dev]",
61
+ "winner": "tie (bothbad)",
62
+ "judge": "arena_user_127.0.0.1",
63
+ "anony": true,
64
+ "tstamp": 1735034259.9984
65
+ },
66
+ {
67
+ "model_a": "ChatDiT",
68
+ "model_b": "GPT-4o + FLUX.1 [dev]",
69
+ "winner": "model_a",
70
+ "judge": "arena_user_127.0.0.1",
71
+ "anony": true,
72
+ "tstamp": 1735034275.6871
73
+ },
74
+ {
75
+ "model_a": "ChatDiT",
76
+ "model_b": "GPT-4o + FLUX.1 [dev]",
77
+ "winner": "model_a",
78
+ "judge": "arena_user_127.0.0.1",
79
+ "anony": true,
80
+ "tstamp": 1735034284.7354
81
+ },
82
+ {
83
+ "model_a": "GPT-4o + FLUX.1 [dev]",
84
+ "model_b": "ChatDiT",
85
+ "winner": "model_a",
86
+ "judge": "arena_user_127.0.0.1",
87
+ "anony": true,
88
+ "tstamp": 1735034293.468
89
+ },
90
+ {
91
+ "model_a": "ChatDiT",
92
+ "model_b": "GPT-4o + FLUX.1 [dev]",
93
+ "winner": "model_b",
94
+ "judge": "arena_user_127.0.0.1",
95
+ "anony": true,
96
+ "tstamp": 1735034303.2042
97
+ },
98
+ {
99
+ "model_a": "ChatDiT",
100
+ "model_b": "GPT-4o + FLUX.1 [dev]",
101
+ "winner": "model_a",
102
+ "judge": "arena_user_127.0.0.1",
103
+ "anony": true,
104
+ "tstamp": 1735034314.1941
105
+ },
106
+ {
107
+ "model_a": "GPT-4o + FLUX.1 [dev]",
108
+ "model_b": "ChatDiT",
109
+ "winner": "model_a",
110
+ "judge": "arena_user_127.0.0.1",
111
+ "anony": true,
112
+ "tstamp": 1735034326.5092
113
+ },
114
+ {
115
+ "model_a": "GPT-4o + FLUX.1 [dev]",
116
+ "model_b": "ChatDiT",
117
+ "winner": "model_b",
118
+ "judge": "arena_user_127.0.0.1",
119
+ "anony": true,
120
+ "tstamp": 1735034331.6963
121
+ },
122
+ {
123
+ "model_a": "GPT-4o + FLUX.1 [dev]",
124
+ "model_b": "ChatDiT",
125
+ "winner": "tie (bothbad)",
126
+ "judge": "arena_user_127.0.0.1",
127
+ "anony": true,
128
+ "tstamp": 1735034336.5346
129
+ },
130
+ {
131
+ "model_a": "ChatDiT",
132
+ "model_b": "GPT-4o + FLUX.1 [dev]",
133
+ "winner": "model_b",
134
+ "judge": "arena_user_127.0.0.1",
135
+ "anony": true,
136
+ "tstamp": 1735034351.9521
137
+ },
138
+ {
139
+ "model_a": "GPT-4o + FLUX.1 [dev]",
140
+ "model_b": "ChatDiT",
141
+ "winner": "model_b",
142
+ "judge": "arena_user_127.0.0.1",
143
+ "anony": true,
144
+ "tstamp": 1735034366.1775
145
+ },
146
+ {
147
+ "model_a": "GPT-4o + FLUX.1 [dev]",
148
+ "model_b": "ChatDiT",
149
+ "winner": "model_a",
150
+ "judge": "arena_user_127.0.0.1",
151
+ "anony": true,
152
+ "tstamp": 1735034380.5877
153
+ },
154
+ {
155
+ "model_a": "ChatDiT",
156
+ "model_b": "GPT-4o + FLUX.1 [dev]",
157
+ "winner": "model_b",
158
+ "judge": "arena_user_127.0.0.1",
159
+ "anony": true,
160
+ "tstamp": 1735034384.3087
161
+ },
162
+ {
163
+ "model_a": "GPT-4o + FLUX.1 [dev]",
164
+ "model_b": "ChatDiT",
165
+ "winner": "model_a",
166
+ "judge": "arena_user_127.0.0.1",
167
+ "anony": true,
168
+ "tstamp": 1735034389.1583
169
+ },
170
+ {
171
+ "model_a": "GPT-4o + FLUX.1 [dev]",
172
+ "model_b": "ChatDiT",
173
+ "winner": "model_b",
174
+ "judge": "arena_user_127.0.0.1",
175
+ "anony": true,
176
+ "tstamp": 1735034405.9359
177
+ },
178
+ {
179
+ "model_a": "GPT-4o + FLUX.1 [dev]",
180
+ "model_b": "ChatDiT",
181
+ "winner": "model_b",
182
+ "judge": "arena_user_127.0.0.1",
183
+ "anony": true,
184
+ "tstamp": 1735034412.3533
185
+ },
186
+ {
187
+ "model_a": "GPT-4o + FLUX.1 [dev]",
188
+ "model_b": "ChatDiT",
189
+ "winner": "model_a",
190
+ "judge": "arena_user_127.0.0.1",
191
+ "anony": true,
192
+ "tstamp": 1735034419.0118
193
+ },
194
+ {
195
+ "model_a": "GPT-4o + FLUX.1 [dev]",
196
+ "model_b": "ChatDiT",
197
+ "winner": "model_b",
198
+ "judge": "arena_user_127.0.0.1",
199
+ "anony": true,
200
+ "tstamp": 1735034425.6972
201
+ },
202
+ {
203
+ "model_a": "GPT-4o + FLUX.1 [dev]",
204
+ "model_b": "ChatDiT",
205
+ "winner": "model_b",
206
+ "judge": "arena_user_127.0.0.1",
207
+ "anony": true,
208
+ "tstamp": 1735034432.5891
209
+ },
210
+ {
211
+ "model_a": "ChatDiT",
212
+ "model_b": "GPT-4o + FLUX.1 [dev]",
213
+ "winner": "model_a",
214
+ "judge": "arena_user_127.0.0.1",
215
+ "anony": true,
216
+ "tstamp": 1735092762.0
217
+ },
218
+ {
219
+ "model_a": "GPT-4o + FLUX.1 [dev]",
220
+ "model_b": "ChatDiT",
221
+ "winner": "tie (bothbad)",
222
+ "judge": "arena_user_127.0.0.1",
223
+ "anony": true,
224
+ "tstamp": 1735092774.618
225
+ },
226
+ {
227
+ "model_a": "GPT-4o + FLUX.1 [dev]",
228
+ "model_b": "ChatDiT",
229
+ "winner": "model_a",
230
+ "judge": "arena_user_127.0.0.1",
231
+ "anony": true,
232
+ "tstamp": 1735092797.2067
233
+ },
234
+ {
235
+ "model_a": "GPT-4o + FLUX.1 [dev]",
236
+ "model_b": "ChatDiT",
237
+ "winner": "model_b",
238
+ "judge": "arena_user_127.0.0.1",
239
+ "anony": true,
240
+ "tstamp": 1735092804.6699
241
+ },
242
+ {
243
+ "model_a": "GPT-4o + FLUX.1 [dev]",
244
+ "model_b": "ChatDiT",
245
+ "winner": "model_a",
246
+ "judge": "arena_user_127.0.0.1",
247
+ "anony": true,
248
+ "tstamp": 1735092810.2635
249
+ },
250
+ {
251
+ "model_a": "GPT-4o + FLUX.1 [dev]",
252
+ "model_b": "ChatDiT",
253
+ "winner": "model_b",
254
+ "judge": "arena_user_127.0.0.1",
255
+ "anony": true,
256
+ "tstamp": 1735093113.5724
257
+ },
258
+ {
259
+ "model_a": "ChatDiT",
260
+ "model_b": "GPT-4o + FLUX.1 [dev]",
261
+ "winner": "tie (bothbad)",
262
+ "judge": "arena_user_127.0.0.1",
263
+ "anony": true,
264
+ "tstamp": 1735093133.2436
265
+ },
266
+ {
267
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
268
+ "model_b": "GPT-4o + OmniGen",
269
+ "winner": "model_a",
270
+ "judge": "arena_user_127.0.0.1",
271
+ "anony": true,
272
+ "tstamp": 1735187628.4881
273
+ },
274
+ {
275
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
276
+ "model_b": "GPT-4o + PixArt-Sigma",
277
+ "winner": "model_b",
278
+ "judge": "arena_user_127.0.0.1",
279
+ "anony": true,
280
+ "tstamp": 1735187649.4872
281
+ },
282
+ {
283
+ "model_a": "GPT-4o + Emu2",
284
+ "model_b": "ChatDiT",
285
+ "winner": "model_a",
286
+ "judge": "arena_user_127.0.0.1",
287
+ "anony": true,
288
+ "tstamp": 1735197562.2637
289
+ },
290
+ {
291
+ "model_a": "GPT-4o + FLUX.1 [dev]",
292
+ "model_b": "GPT-4o + PixArt-Sigma",
293
+ "winner": "model_a",
294
+ "judge": "arena_user_127.0.0.1",
295
+ "anony": true,
296
+ "tstamp": 1735197586.8438
297
+ },
298
+ {
299
+ "model_a": "ChatDiT",
300
+ "model_b": "GPT-4o + FLUX.1 [dev]",
301
+ "winner": "model_a",
302
+ "judge": "arena_user_127.0.0.1",
303
+ "anony": false,
304
+ "tstamp": 1735201758.7145
305
+ },
306
+ {
307
+ "model_a": "GPT-4o + DALLE-3",
308
+ "model_b": "GPT-4o + PixArt-Sigma",
309
+ "winner": "model_b",
310
+ "judge": "arena_user_127.0.0.1",
311
+ "anony": false,
312
+ "tstamp": 1735202083.631
313
+ },
314
+ {
315
+ "model_a": "GPT-4o + DALLE-3",
316
+ "model_b": "GPT-4o + PixArt-Sigma",
317
+ "winner": "model_a",
318
+ "judge": "arena_user_127.0.0.1",
319
+ "anony": false,
320
+ "tstamp": 1735202099.4377
321
+ },
322
+ {
323
+ "model_a": "GPT-4o + OmniGen",
324
+ "model_b": "ChatDiT",
325
+ "winner": "model_b",
326
+ "judge": "arena_user_127.0.0.1",
327
+ "anony": true,
328
+ "tstamp": 1735202132.8592
329
+ },
330
+ {
331
+ "model_a": "GPT-4o + DALLE-3",
332
+ "model_b": "GPT-4o + PixArt-Sigma",
333
+ "winner": "model_b",
334
+ "judge": "arena_user_127.0.0.1",
335
+ "anony": false,
336
+ "tstamp": 1735202545.8694
337
+ },
338
+ {
339
+ "model_a": "GPT-4o + DALLE-3",
340
+ "model_b": "GPT-4o + PixArt-Sigma",
341
+ "winner": "model_a",
342
+ "judge": "arena_user_127.0.0.1",
343
+ "anony": false,
344
+ "tstamp": 1735202565.5723
345
+ },
346
+ {
347
+ "model_a": "GPT-4o + DALLE-3",
348
+ "model_b": "GPT-4o + PixArt-Sigma",
349
+ "winner": "tie (bothbad)",
350
+ "judge": "arena_user_127.0.0.1",
351
+ "anony": false,
352
+ "tstamp": 1735202573.0118
353
+ },
354
+ {
355
+ "model_a": "GPT-4o + DALLE-3",
356
+ "model_b": "GPT-4o + PixArt-Sigma",
357
+ "winner": "tie (bothbad)",
358
+ "judge": "arena_user_127.0.0.1",
359
+ "anony": false,
360
+ "tstamp": 1735203523.809
361
+ },
362
+ {
363
+ "model_a": "GPT-4o + OmniGen",
364
+ "model_b": "GPT-4o + DALLE-3",
365
+ "winner": "model_b",
366
+ "judge": "arena_user_127.0.0.1",
367
+ "anony": true,
368
+ "tstamp": 1735205600.7414
369
+ },
370
+ {
371
+ "model_a": "ChatDiT",
372
+ "model_b": "GPT-4o + DALLE-3",
373
+ "winner": "model_a",
374
+ "judge": "arena_user_127.0.0.1",
375
+ "anony": true,
376
+ "tstamp": 1735207454.8251
377
+ },
378
+ {
379
+ "model_a": "GPT-4o + OmniGen",
380
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
381
+ "winner": "model_b",
382
+ "judge": "arena_user_127.0.0.1",
383
+ "anony": true,
384
+ "tstamp": 1735207466.0131
385
+ },
386
+ {
387
+ "model_a": "GPT-4o + DALLE-3",
388
+ "model_b": "GPT-4o + Emu2",
389
+ "winner": "model_b",
390
+ "judge": "arena_user_127.0.0.1",
391
+ "anony": true,
392
+ "tstamp": 1735215923.1589
393
+ },
394
+ {
395
+ "model_a": "GPT-4o + PixArt-Sigma",
396
+ "model_b": "GPT-4o + DALLE-3",
397
+ "winner": "model_a",
398
+ "judge": "arena_user_127.0.0.1",
399
+ "anony": true,
400
+ "tstamp": 1735215935.7597
401
+ },
402
+ {
403
+ "model_a": "GPT-4o + OmniGen",
404
+ "model_b": "GPT-4o + PixArt-Sigma",
405
+ "winner": "tie (bothbad)",
406
+ "judge": "arena_user_127.0.0.1",
407
+ "anony": true,
408
+ "tstamp": 1735215942.7093
409
+ },
410
+ {
411
+ "model_a": "GPT-4o + PixArt-Sigma",
412
+ "model_b": "GPT-4o + OmniGen",
413
+ "winner": "model_a",
414
+ "judge": "arena_user_127.0.0.1",
415
+ "anony": true,
416
+ "tstamp": 1735215949.7965
417
+ },
418
+ {
419
+ "model_a": "GPT-4o + DALLE-3",
420
+ "model_b": "ChatDiT",
421
+ "winner": "model_b",
422
+ "judge": "arena_user_127.0.0.1",
423
+ "anony": true,
424
+ "tstamp": 1735215962.6898
425
+ },
426
+ {
427
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
428
+ "model_b": "GPT-4o + DALLE-3",
429
+ "winner": "tie (bothbad)",
430
+ "judge": "arena_user_127.0.0.1",
431
+ "anony": true,
432
+ "tstamp": 1735215968.9052
433
+ },
434
+ {
435
+ "model_a": "GPT-4o + FLUX.1 [dev]",
436
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
437
+ "winner": "tie (bothbad)",
438
+ "judge": "arena_user_127.0.0.1",
439
+ "anony": true,
440
+ "tstamp": 1735215976.5079
441
+ },
442
+ {
443
+ "model_a": "GPT-4o + Emu2",
444
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
445
+ "winner": "model_b",
446
+ "judge": "arena_user_127.0.0.1",
447
+ "anony": true,
448
+ "tstamp": 1735215982.9709
449
+ },
450
+ {
451
+ "model_a": "ChatDiT",
452
+ "model_b": "GPT-4o + PixArt-Sigma",
453
+ "winner": "model_a",
454
+ "judge": "arena_user_127.0.0.1",
455
+ "anony": true,
456
+ "tstamp": 1735215993.2305
457
+ },
458
+ {
459
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
460
+ "model_b": "GPT-4o + FLUX.1 [dev]",
461
+ "winner": "tie (bothbad)",
462
+ "judge": "arena_user_127.0.0.1",
463
+ "anony": true,
464
+ "tstamp": 1735215999.8713
465
+ },
466
+ {
467
+ "model_a": "GPT-4o + PixArt-Sigma",
468
+ "model_b": "GPT-4o + FLUX.1 [dev]",
469
+ "winner": "model_b",
470
+ "judge": "arena_user_127.0.0.1",
471
+ "anony": true,
472
+ "tstamp": 1735216012.8216
473
+ },
474
+ {
475
+ "model_a": "ChatDiT",
476
+ "model_b": "GPT-4o + PixArt-Sigma",
477
+ "winner": "model_a",
478
+ "judge": "arena_user_127.0.0.1",
479
+ "anony": true,
480
+ "tstamp": 1735216021.653
481
+ },
482
+ {
483
+ "model_a": "GPT-4o + PixArt-Sigma",
484
+ "model_b": "GPT-4o + OmniGen",
485
+ "winner": "model_b",
486
+ "judge": "arena_user_127.0.0.1",
487
+ "anony": true,
488
+ "tstamp": 1735286354.5764
489
+ },
490
+ {
491
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
492
+ "model_b": "ChatDiT",
493
+ "winner": "tie (bothbad)",
494
+ "judge": "arena_user_127.0.0.1",
495
+ "anony": true,
496
+ "tstamp": 1735286365.2329
497
+ },
498
+ {
499
+ "model_a": "GPT-4o + Emu2",
500
+ "model_b": "ChatDiT",
501
+ "winner": "model_a",
502
+ "judge": "arena_user_127.0.0.1",
503
+ "anony": true,
504
+ "tstamp": 1735286374.6751
505
+ },
506
+ {
507
+ "model_a": "GPT-4o + FLUX.1 [dev]",
508
+ "model_b": "GPT-4o + Emu2",
509
+ "winner": "model_a",
510
+ "judge": "arena_user_127.0.0.1",
511
+ "anony": true,
512
+ "tstamp": 1735286382.1211
513
+ },
514
+ {
515
+ "model_a": "GPT-4o + PixArt-Sigma",
516
+ "model_b": "GPT-4o + OmniGen",
517
+ "winner": "model_a",
518
+ "judge": "arena_user_127.0.0.1",
519
+ "anony": true,
520
+ "tstamp": 1735288723.7052
521
+ },
522
+ {
523
+ "model_a": "GPT-4o + FLUX.1 [dev]",
524
+ "model_b": "GPT-4o + DALLE-3",
525
+ "winner": "model_a",
526
+ "judge": "arena_user_127.0.0.1",
527
+ "anony": true,
528
+ "tstamp": 1735288729.3576
529
+ },
530
+ {
531
+ "model_a": "GPT-4o + PixArt-Sigma",
532
+ "model_b": "GPT-4o + OmniGen",
533
+ "winner": "model_a",
534
+ "judge": "arena_user_127.0.0.1",
535
+ "anony": true,
536
+ "tstamp": 1735288749.1708
537
+ },
538
+ {
539
+ "model_a": "GPT-4o + FLUX.1 [dev]",
540
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
541
+ "winner": "model_a",
542
+ "judge": "arena_user_127.0.0.1",
543
+ "anony": true,
544
+ "tstamp": 1736305459.7554
545
+ },
546
+ {
547
+ "model_a": "GPT-4o + Emu2",
548
+ "model_b": "GPT-4o + DALLE-3",
549
+ "winner": "model_b",
550
+ "judge": "arena_user_127.0.0.1",
551
+ "anony": true,
552
+ "tstamp": 1736305568.3703
553
+ },
554
+ {
555
+ "model_a": "GPT-4o + Emu2",
556
+ "model_b": "GPT-4o + PixArt-Sigma",
557
+ "winner": "model_b",
558
+ "judge": "arena_user_127.0.0.1",
559
+ "anony": true,
560
+ "tstamp": 1736305578.3648
561
+ },
562
+ {
563
+ "model_a": "GPT-4o + OmniGen",
564
+ "model_b": "GPT-4o + PixArt-Sigma",
565
+ "winner": "model_b",
566
+ "judge": "arena_user_10.16.39.228",
567
+ "anony": true,
568
+ "tstamp": 1736316463.4086
569
+ },
570
+ {
571
+ "model_a": "GPT-4o + OmniGen",
572
+ "model_b": "GPT-4o + PixArt-Sigma",
573
+ "winner": "model_a",
574
+ "judge": "arena_user_10.16.30.109",
575
+ "anony": true,
576
+ "tstamp": 1736316525.3474
577
+ },
578
+ {
579
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
580
+ "model_b": "GPT-4o + DALLE-3",
581
+ "winner": "tie (bothbad)",
582
+ "judge": "arena_user_10.16.9.166",
583
+ "anony": false,
584
+ "tstamp": 1736317079.2219
585
+ },
586
+ {
587
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
588
+ "model_b": "GPT-4o + OmniGen",
589
+ "winner": "model_a",
590
+ "judge": "arena_user_10.16.39.228",
591
+ "anony": true,
592
+ "tstamp": 1736317103.5229
593
+ },
594
+ {
595
+ "model_a": "ChatDiT",
596
+ "model_b": "GPT-4o + PixArt-Sigma",
597
+ "winner": "model_b",
598
+ "judge": "arena_user_10.16.9.166",
599
+ "anony": true,
600
+ "tstamp": 1736317151.2313
601
+ },
602
+ {
603
+ "model_a": "GPT-4o + Emu2",
604
+ "model_b": "GPT-4o + PixArt-Sigma",
605
+ "winner": "model_b",
606
+ "judge": "arena_user_10.16.24.150",
607
+ "anony": true,
608
+ "tstamp": 1736317260.068
609
+ },
610
+ {
611
+ "model_a": "GPT-4o + PixArt-Sigma",
612
+ "model_b": "GPT-4o + OmniGen",
613
+ "winner": "model_b",
614
+ "judge": "arena_user_172.18.13.178",
615
+ "anony": true,
616
+ "tstamp": 1736320695.0812
617
+ },
618
+ {
619
+ "model_a": "ChatDiT",
620
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
621
+ "winner": "model_a",
622
+ "judge": "arena_user_10.16.43.67",
623
+ "anony": true,
624
+ "tstamp": 1736321735.4094
625
+ },
626
+ {
627
+ "model_a": "GPT-4o + PixArt-Sigma",
628
+ "model_b": "GPT-4o + FLUX.1 [dev]",
629
+ "winner": "model_a",
630
+ "judge": "arena_user_10.16.24.150",
631
+ "anony": true,
632
+ "tstamp": 1736335598.6764
633
+ },
634
+ {
635
+ "model_a": "GPT-4o + OmniGen",
636
+ "model_b": "GPT-4o + PixArt-Sigma",
637
+ "winner": "model_a",
638
+ "judge": "arena_user_10.16.43.67",
639
+ "anony": true,
640
+ "tstamp": 1736422216.5691
641
+ },
642
+ {
643
+ "model_a": "GPT-4o + DALLE-3",
644
+ "model_b": "GPT-4o + OmniGen",
645
+ "winner": "model_a",
646
+ "judge": "arena_user_10.16.4.183",
647
+ "anony": true,
648
+ "tstamp": 1736422234.0031
649
+ },
650
+ {
651
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
652
+ "model_b": "GPT-4o + FLUX.1 [dev]",
653
+ "winner": "model_a",
654
+ "judge": "arena_user_10.16.4.183",
655
+ "anony": true,
656
+ "tstamp": 1736422247.0046
657
+ },
658
+ {
659
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
660
+ "model_b": "GPT-4o + PixArt-Sigma",
661
+ "winner": "model_b",
662
+ "judge": "arena_user_10.16.30.109",
663
+ "anony": true,
664
+ "tstamp": 1736459822.3244
665
+ },
666
+ {
667
+ "model_a": "ChatDiT",
668
+ "model_b": "GPT-4o + FLUX.1 [dev]",
669
+ "winner": "model_b",
670
+ "judge": "arena_user_10.16.5.242",
671
+ "anony": false,
672
+ "tstamp": 1736459862.4607
673
+ },
674
+ {
675
+ "model_a": "ChatDiT",
676
+ "model_b": "GPT-4o + FLUX.1 [dev]",
677
+ "winner": "model_a",
678
+ "judge": "arena_user_10.16.5.242",
679
+ "anony": false,
680
+ "tstamp": 1736459879.8897
681
+ },
682
+ {
683
+ "model_a": "ChatDiT",
684
+ "model_b": "GPT-4o + FLUX.1 [dev]",
685
+ "winner": "model_b",
686
+ "judge": "arena_user_10.16.43.67",
687
+ "anony": false,
688
+ "tstamp": 1736459886.1956
689
+ },
690
+ {
691
+ "model_a": "ChatDiT",
692
+ "model_b": "GPT-4o + FLUX.1 [dev]",
693
+ "winner": "model_b",
694
+ "judge": "arena_user_10.16.30.109",
695
+ "anony": false,
696
+ "tstamp": 1736459892.1553
697
+ },
698
+ {
699
+ "model_a": "ChatDiT",
700
+ "model_b": "GPT-4o + FLUX.1 [dev]",
701
+ "winner": "model_a",
702
+ "judge": "arena_user_10.16.5.242",
703
+ "anony": false,
704
+ "tstamp": 1736459905.8186
705
+ },
706
+ {
707
+ "model_a": "GPT-4o + PixArt-Sigma",
708
+ "model_b": "GPT-4o + DALLE-3",
709
+ "winner": "model_a",
710
+ "judge": "arena_user_10.16.5.242",
711
+ "anony": true,
712
+ "tstamp": 1736461355.0274
713
+ },
714
+ {
715
+ "model_a": "GPT-4o + FLUX.1 [dev]",
716
+ "model_b": "GPT-4o + DALLE-3",
717
+ "winner": "model_a",
718
+ "judge": "arena_user_10.16.5.242",
719
+ "anony": true,
720
+ "tstamp": 1736461393.0574
721
+ },
722
+ {
723
+ "model_a": "GPT-4o + Emu2",
724
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
725
+ "winner": "model_b",
726
+ "judge": "arena_user_10.16.5.242",
727
+ "anony": true,
728
+ "tstamp": 1736461404.4455
729
+ },
730
+ {
731
+ "model_a": "ChatDiT",
732
+ "model_b": "GPT-4o + FLUX.1 [dev]",
733
+ "winner": "model_a",
734
+ "judge": "arena_user_10.16.30.109",
735
+ "anony": false,
736
+ "tstamp": 1736461424.6169
737
+ },
738
+ {
739
+ "model_a": "ChatDiT",
740
+ "model_b": "GPT-4o + FLUX.1 [dev]",
741
+ "winner": "model_a",
742
+ "judge": "arena_user_10.16.5.242",
743
+ "anony": false,
744
+ "tstamp": 1736461434.3463
745
+ },
746
+ {
747
+ "model_a": "ChatDiT",
748
+ "model_b": "GPT-4o + FLUX.1 [dev]",
749
+ "winner": "model_a",
750
+ "judge": "arena_user_10.16.30.109",
751
+ "anony": false,
752
+ "tstamp": 1736461445.38
753
+ },
754
+ {
755
+ "model_a": "ChatDiT",
756
+ "model_b": "GPT-4o + FLUX.1 [dev]",
757
+ "winner": "model_b",
758
+ "judge": "arena_user_10.16.30.109",
759
+ "anony": false,
760
+ "tstamp": 1736461454.3017
761
+ },
762
+ {
763
+ "model_a": "GPT-4o + PixArt-Sigma",
764
+ "model_b": "GPT-4o + DALLE-3",
765
+ "winner": "model_b",
766
+ "judge": "arena_user_10.16.30.109",
767
+ "anony": true,
768
+ "tstamp": 1736499082.7837
769
+ },
770
+ {
771
+ "model_a": "ChatDiT",
772
+ "model_b": "ChatDiT",
773
+ "winner": "tie (bothbad)",
774
+ "judge": "arena_user_10.20.26.107",
775
+ "anony": false,
776
+ "tstamp": 1736510204.8628
777
+ },
778
+ {
779
+ "model_a": "GPT-4o + DALLE-3",
780
+ "model_b": "GPT-4o + OmniGen",
781
+ "winner": "model_a",
782
+ "judge": "arena_user_10.20.26.107",
783
+ "anony": false,
784
+ "tstamp": 1736510267.1272
785
+ },
786
+ {
787
+ "model_a": "GPT-4o + DALLE-3",
788
+ "model_b": "GPT-4o + OmniGen",
789
+ "winner": "model_a",
790
+ "judge": "arena_user_10.20.34.20",
791
+ "anony": false,
792
+ "tstamp": 1736510278.5264
793
+ },
794
+ {
795
+ "model_a": "GPT-4o + DALLE-3",
796
+ "model_b": "GPT-4o + OmniGen",
797
+ "winner": "model_a",
798
+ "judge": "arena_user_10.20.34.20",
799
+ "anony": false,
800
+ "tstamp": 1736510292.1874
801
+ },
802
+ {
803
+ "model_a": "GPT-4o + FLUX.1 [dev]",
804
+ "model_b": "ChatDiT",
805
+ "winner": "model_a",
806
+ "judge": "arena_user_172.18.13.116",
807
+ "anony": true,
808
+ "tstamp": 1736519867.6924
809
+ },
810
+ {
811
+ "model_a": "GPT-4o + PixArt-Sigma",
812
+ "model_b": "GPT-4o + FLUX.1 [dev]",
813
+ "winner": "model_a",
814
+ "judge": "arena_user_172.18.13.199",
815
+ "anony": true,
816
+ "tstamp": 1736519902.5557
817
+ },
818
+ {
819
+ "model_a": "GPT-4o + FLUX.1 [dev]",
820
+ "model_b": "GPT-4o + OmniGen",
821
+ "winner": "model_a",
822
+ "judge": "arena_user_172.18.10.33",
823
+ "anony": true,
824
+ "tstamp": 1736520001.5821
825
+ },
826
+ {
827
+ "model_a": "GPT-4o + FLUX.1 [dev]",
828
+ "model_b": "GPT-4o + Emu2",
829
+ "winner": "model_a",
830
+ "judge": "arena_user_10.16.5.242",
831
+ "anony": true,
832
+ "tstamp": 1736747098.5538
833
+ },
834
+ {
835
+ "model_a": "GPT-4o + DALLE-3",
836
+ "model_b": "GPT-4o + Emu2",
837
+ "winner": "tie (bothbad)",
838
+ "judge": "arena_user_10.16.5.242",
839
+ "anony": true,
840
+ "tstamp": 1736747127.3685
841
+ },
842
+ {
843
+ "model_a": "GPT-4o + DALLE-3",
844
+ "model_b": "GPT-4o + FLUX.1 [dev]",
845
+ "winner": "model_a",
846
+ "judge": "arena_user_10.16.39.228",
847
+ "anony": true,
848
+ "tstamp": 1736747185.1173
849
+ },
850
+ {
851
+ "model_a": "GPT-4o + Emu2",
852
+ "model_b": "GPT-4o + PixArt-Sigma",
853
+ "winner": "model_b",
854
+ "judge": "arena_user_10.16.39.228",
855
+ "anony": true,
856
+ "tstamp": 1736747224.0831
857
+ },
858
+ {
859
+ "model_a": "GPT-4o + DALLE-3",
860
+ "model_b": "GPT-4o + FLUX.1 [dev]",
861
+ "winner": "tie (bothbad)",
862
+ "judge": "arena_user_10.16.5.242",
863
+ "anony": true,
864
+ "tstamp": 1736747246.0112
865
+ },
866
+ {
867
+ "model_a": "GPT-4o + PixArt-Sigma",
868
+ "model_b": "GPT-4o + FLUX.1 [dev]",
869
+ "winner": "model_b",
870
+ "judge": "arena_user_10.16.19.14",
871
+ "anony": true,
872
+ "tstamp": 1736747351.9581
873
+ },
874
+ {
875
+ "model_a": "GPT-4o + DALLE-3",
876
+ "model_b": "GPT-4o + OmniGen",
877
+ "winner": "model_a",
878
+ "judge": "arena_user_10.16.39.228",
879
+ "anony": true,
880
+ "tstamp": 1736848254.0065
881
+ },
882
+ {
883
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
884
+ "model_b": "ChatDiT",
885
+ "winner": "model_a",
886
+ "judge": "arena_user_10.16.5.242",
887
+ "anony": true,
888
+ "tstamp": 1736848277.7204
889
+ },
890
+ {
891
+ "model_a": "GPT-4o + DALLE-3",
892
+ "model_b": "GPT-4o + Emu2",
893
+ "winner": "model_b",
894
+ "judge": "arena_user_10.16.5.242",
895
+ "anony": true,
896
+ "tstamp": 1736848340.7214
897
+ },
898
+ {
899
+ "model_a": "GPT-4o + OmniGen",
900
+ "model_b": "GPT-4o + Emu2",
901
+ "winner": "tie (bothbad)",
902
+ "judge": "arena_user_10.16.19.14",
903
+ "anony": true,
904
+ "tstamp": 1736848375.4713
905
+ },
906
+ {
907
+ "model_a": "GPT-4o + FLUX.1 [dev]",
908
+ "model_b": "GPT-4o + OmniGen",
909
+ "winner": "tie (bothbad)",
910
+ "judge": "arena_user_10.16.37.94",
911
+ "anony": true,
912
+ "tstamp": 1736848396.7144
913
+ },
914
+ {
915
+ "model_a": "ChatDiT",
916
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
917
+ "winner": "model_b",
918
+ "judge": "arena_user_10.16.19.14",
919
+ "anony": true,
920
+ "tstamp": 1736848420.9218
921
+ },
922
+ {
923
+ "model_a": "GPT-4o + Emu2",
924
+ "model_b": "GPT-4o + FLUX.1 [dev]",
925
+ "winner": "model_b",
926
+ "judge": "arena_user_10.16.5.242",
927
+ "anony": true,
928
+ "tstamp": 1736848435.2839
929
+ },
930
+ {
931
+ "model_a": "ChatDiT",
932
+ "model_b": "GPT-4o + FLUX.1 [dev]",
933
+ "winner": "model_b",
934
+ "judge": "arena_user_10.16.39.228",
935
+ "anony": true,
936
+ "tstamp": 1736848454.9223
937
+ },
938
+ {
939
+ "model_a": "GPT-4o + DALLE-3",
940
+ "model_b": "ChatDiT",
941
+ "winner": "tie (bothbad)",
942
+ "judge": "arena_user_10.16.37.94",
943
+ "anony": true,
944
+ "tstamp": 1736903972.8796
945
+ },
946
+ {
947
+ "model_a": "GPT-4o + OmniGen",
948
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
949
+ "winner": "model_b",
950
+ "judge": "arena_user_10.16.5.242",
951
+ "anony": true,
952
+ "tstamp": 1736904095.5894
953
+ },
954
+ {
955
+ "model_a": "GPT-4o + DALLE-3",
956
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
957
+ "winner": "model_b",
958
+ "judge": "arena_user_10.16.37.94",
959
+ "anony": true,
960
+ "tstamp": 1736904105.9232
961
+ },
962
+ {
963
+ "model_a": "GPT-4o + FLUX.1 [dev]",
964
+ "model_b": "GPT-4o + DALLE-3",
965
+ "winner": "tie (bothbad)",
966
+ "judge": "arena_user_10.16.39.228",
967
+ "anony": true,
968
+ "tstamp": 1736904243.1258
969
+ },
970
+ {
971
+ "model_a": "GPT-4o + DALLE-3",
972
+ "model_b": "GPT-4o + PixArt-Sigma",
973
+ "winner": "model_a",
974
+ "judge": "arena_user_10.16.37.94",
975
+ "anony": true,
976
+ "tstamp": 1736904288.108
977
+ },
978
+ {
979
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
980
+ "model_b": "ChatDiT",
981
+ "winner": "model_a",
982
+ "judge": "arena_user_10.16.5.183",
983
+ "anony": true,
984
+ "tstamp": 1736904394.6512
985
+ },
986
+ {
987
+ "model_a": "GPT-4o + PixArt-Sigma",
988
+ "model_b": "GPT-4o + Emu2",
989
+ "winner": "model_b",
990
+ "judge": "arena_user_10.16.5.183",
991
+ "anony": true,
992
+ "tstamp": 1736904497.6642
993
+ },
994
+ {
995
+ "model_a": "GPT-4o + PixArt-Sigma",
996
+ "model_b": "GPT-4o + Emu2",
997
+ "winner": "tie (bothbad)",
998
+ "judge": "arena_user_10.16.19.14",
999
+ "anony": true,
1000
+ "tstamp": 1736904527.5648
1001
+ },
1002
+ {
1003
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1004
+ "model_b": "GPT-4o + OmniGen",
1005
+ "winner": "tie (bothbad)",
1006
+ "judge": "arena_user_10.16.5.242",
1007
+ "anony": true,
1008
+ "tstamp": 1736904578.7368
1009
+ },
1010
+ {
1011
+ "model_a": "GPT-4o + DALLE-3",
1012
+ "model_b": "ChatDiT",
1013
+ "winner": "model_a",
1014
+ "judge": "arena_user_10.16.37.94",
1015
+ "anony": true,
1016
+ "tstamp": 1736904638.5873
1017
+ },
1018
+ {
1019
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1020
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1021
+ "winner": "tie (bothbad)",
1022
+ "judge": "arena_user_10.16.39.228",
1023
+ "anony": true,
1024
+ "tstamp": 1736904690.3944
1025
+ },
1026
+ {
1027
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1028
+ "model_b": "GPT-4o + OmniGen",
1029
+ "winner": "tie (bothbad)",
1030
+ "judge": "arena_user_10.16.39.228",
1031
+ "anony": true,
1032
+ "tstamp": 1736904719.2626
1033
+ },
1034
+ {
1035
+ "model_a": "GPT-4o + OmniGen",
1036
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1037
+ "winner": "model_b",
1038
+ "judge": "arena_user_10.16.19.14",
1039
+ "anony": true,
1040
+ "tstamp": 1736904866.9338
1041
+ },
1042
+ {
1043
+ "model_a": "ChatDiT",
1044
+ "model_b": "GPT-4o + PixArt-Sigma",
1045
+ "winner": "model_b",
1046
+ "judge": "arena_user_10.16.5.242",
1047
+ "anony": true,
1048
+ "tstamp": 1736904988.1352
1049
+ },
1050
+ {
1051
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1052
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1053
+ "winner": "model_a",
1054
+ "judge": "arena_user_10.16.39.228",
1055
+ "anony": true,
1056
+ "tstamp": 1736905085.4366
1057
+ },
1058
+ {
1059
+ "model_a": "GPT-4o + Emu2",
1060
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1061
+ "winner": "tie (bothbad)",
1062
+ "judge": "arena_user_10.16.37.94",
1063
+ "anony": true,
1064
+ "tstamp": 1736905112.817
1065
+ },
1066
+ {
1067
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1068
+ "model_b": "ChatDiT",
1069
+ "winner": "tie (bothbad)",
1070
+ "judge": "arena_user_10.16.5.242",
1071
+ "anony": true,
1072
+ "tstamp": 1736905145.5873
1073
+ },
1074
+ {
1075
+ "model_a": "GPT-4o + Emu2",
1076
+ "model_b": "GPT-4o + PixArt-Sigma",
1077
+ "winner": "model_b",
1078
+ "judge": "arena_user_10.16.39.228",
1079
+ "anony": true,
1080
+ "tstamp": 1736905182.7192
1081
+ },
1082
+ {
1083
+ "model_a": "GPT-4o + Emu2",
1084
+ "model_b": "ChatDiT",
1085
+ "winner": "model_b",
1086
+ "judge": "arena_user_10.16.5.242",
1087
+ "anony": true,
1088
+ "tstamp": 1736905206.448
1089
+ },
1090
+ {
1091
+ "model_a": "GPT-4o + DALLE-3",
1092
+ "model_b": "GPT-4o + PixArt-Sigma",
1093
+ "winner": "tie (bothbad)",
1094
+ "judge": "arena_user_10.16.39.228",
1095
+ "anony": true,
1096
+ "tstamp": 1736905228.476
1097
+ },
1098
+ {
1099
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1100
+ "model_b": "GPT-4o + Emu2",
1101
+ "winner": "model_a",
1102
+ "judge": "arena_user_10.16.5.242",
1103
+ "anony": true,
1104
+ "tstamp": 1736905283.1204
1105
+ },
1106
+ {
1107
+ "model_a": "ChatDiT",
1108
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1109
+ "winner": "tie (bothbad)",
1110
+ "judge": "arena_user_10.16.37.94",
1111
+ "anony": false,
1112
+ "tstamp": 1737049708.7299
1113
+ },
1114
+ {
1115
+ "model_a": "ChatDiT",
1116
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1117
+ "winner": "model_a",
1118
+ "judge": "arena_user_10.16.37.94",
1119
+ "anony": false,
1120
+ "tstamp": 1737049726.3072
1121
+ },
1122
+ {
1123
+ "model_a": "GPT-4o + PixArt-Sigma",
1124
+ "model_b": "GPT-4o + DALLE-3",
1125
+ "winner": "model_a",
1126
+ "judge": "arena_user_10.16.39.228",
1127
+ "anony": true,
1128
+ "tstamp": 1737260184.9546
1129
+ },
1130
+ {
1131
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1132
+ "model_b": "ChatDiT",
1133
+ "winner": "tie (bothbad)",
1134
+ "judge": "arena_user_10.16.37.94",
1135
+ "anony": true,
1136
+ "tstamp": 1737342007.3311
1137
+ },
1138
+ {
1139
+ "model_a": "GPT-4o + PixArt-Sigma",
1140
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1141
+ "winner": "model_a",
1142
+ "judge": "arena_user_10.16.19.14",
1143
+ "anony": true,
1144
+ "tstamp": 1737342038.3455
1145
+ },
1146
+ {
1147
+ "model_a": "GPT-4o + Emu2",
1148
+ "model_b": "GPT-4o + PixArt-Sigma",
1149
+ "winner": "model_b",
1150
+ "judge": "arena_user_10.16.37.94",
1151
+ "anony": true,
1152
+ "tstamp": 1737342063.6411
1153
+ },
1154
+ {
1155
+ "model_a": "GPT-4o + Emu2",
1156
+ "model_b": "GPT-4o + PixArt-Sigma",
1157
+ "winner": "model_b",
1158
+ "judge": "arena_user_10.16.39.228",
1159
+ "anony": true,
1160
+ "tstamp": 1737342081.2581
1161
+ },
1162
+ {
1163
+ "model_a": "GPT-4o + DALLE-3",
1164
+ "model_b": "GPT-4o + Emu2",
1165
+ "winner": "tie (bothbad)",
1166
+ "judge": "arena_user_10.16.5.242",
1167
+ "anony": true,
1168
+ "tstamp": 1737342094.1056
1169
+ },
1170
+ {
1171
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1172
+ "model_b": "ChatDiT",
1173
+ "winner": "model_a",
1174
+ "judge": "arena_user_10.16.37.94",
1175
+ "anony": true,
1176
+ "tstamp": 1737342133.6158
1177
+ },
1178
+ {
1179
+ "model_a": "GPT-4o + Emu2",
1180
+ "model_b": "ChatDiT",
1181
+ "winner": "tie (bothbad)",
1182
+ "judge": "arena_user_10.16.5.242",
1183
+ "anony": true,
1184
+ "tstamp": 1737342168.5823
1185
+ },
1186
+ {
1187
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1188
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1189
+ "winner": "model_a",
1190
+ "judge": "arena_user_10.16.5.242",
1191
+ "anony": true,
1192
+ "tstamp": 1737345352.0281
1193
+ },
1194
+ {
1195
+ "model_a": "ChatDiT",
1196
+ "model_b": "GPT-4o + PixArt-Sigma",
1197
+ "winner": "tie (bothbad)",
1198
+ "judge": "arena_user_10.16.37.94",
1199
+ "anony": true,
1200
+ "tstamp": 1737345374.5042
1201
+ },
1202
+ {
1203
+ "model_a": "ChatDiT",
1204
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1205
+ "winner": "tie (bothbad)",
1206
+ "judge": "arena_user_10.16.5.183",
1207
+ "anony": false,
1208
+ "tstamp": 1737345480.6137
1209
+ },
1210
+ {
1211
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1212
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1213
+ "winner": "tie (bothbad)",
1214
+ "judge": "arena_user_10.16.19.14",
1215
+ "anony": true,
1216
+ "tstamp": 1737416898.513
1217
+ },
1218
+ {
1219
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1220
+ "model_b": "GPT-4o + OmniGen",
1221
+ "winner": "model_a",
1222
+ "judge": "arena_user_10.16.19.14",
1223
+ "anony": true,
1224
+ "tstamp": 1737416913.3366
1225
+ },
1226
+ {
1227
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1228
+ "model_b": "GPT-4o + Emu2",
1229
+ "winner": "model_a",
1230
+ "judge": "arena_user_10.16.39.228",
1231
+ "anony": true,
1232
+ "tstamp": 1737416931.2714
1233
+ },
1234
+ {
1235
+ "model_a": "ChatDiT",
1236
+ "model_b": "GPT-4o + OmniGen",
1237
+ "winner": "model_b",
1238
+ "judge": "arena_user_10.16.39.228",
1239
+ "anony": true,
1240
+ "tstamp": 1737456983.6035
1241
+ },
1242
+ {
1243
+ "model_a": "ChatDiT",
1244
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1245
+ "winner": "model_b",
1246
+ "judge": "arena_user_10.16.5.183",
1247
+ "anony": false,
1248
+ "tstamp": 1737617763.3504
1249
+ },
1250
+ {
1251
+ "model_a": "GPT-4o + OmniGen",
1252
+ "model_b": "GPT-4o + DALLE-3",
1253
+ "winner": "model_b",
1254
+ "judge": "arena_user_10.16.46.168",
1255
+ "anony": true,
1256
+ "tstamp": 1737873713.8355
1257
+ },
1258
+ {
1259
+ "model_a": "ChatDiT",
1260
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1261
+ "winner": "model_b",
1262
+ "judge": "arena_user_10.20.26.107",
1263
+ "anony": false,
1264
+ "tstamp": 1737993471.7271
1265
+ },
1266
+ {
1267
+ "model_a": "ChatDiT",
1268
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1269
+ "winner": "model_b",
1270
+ "judge": "arena_user_10.20.34.20",
1271
+ "anony": false,
1272
+ "tstamp": 1737993492.6951
1273
+ },
1274
+ {
1275
+ "model_a": "ChatDiT",
1276
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1277
+ "winner": "model_b",
1278
+ "judge": "arena_user_10.20.26.107",
1279
+ "anony": false,
1280
+ "tstamp": 1737993507.3486
1281
+ },
1282
+ {
1283
+ "model_a": "ChatDiT",
1284
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1285
+ "winner": "model_b",
1286
+ "judge": "arena_user_10.20.34.20",
1287
+ "anony": false,
1288
+ "tstamp": 1737993526.1914
1289
+ },
1290
+ {
1291
+ "model_a": "ChatDiT",
1292
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1293
+ "winner": "model_a",
1294
+ "judge": "arena_user_10.20.34.20",
1295
+ "anony": false,
1296
+ "tstamp": 1737993546.952
1297
+ },
1298
+ {
1299
+ "model_a": "ChatDiT",
1300
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1301
+ "winner": "tie (bothbad)",
1302
+ "judge": "arena_user_10.20.34.20",
1303
+ "anony": false,
1304
+ "tstamp": 1737993570.2157
1305
+ },
1306
+ {
1307
+ "model_a": "ChatDiT",
1308
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1309
+ "winner": "model_b",
1310
+ "judge": "arena_user_10.20.34.20",
1311
+ "anony": false,
1312
+ "tstamp": 1737993609.9858
1313
+ },
1314
+ {
1315
+ "model_a": "GPT-4o + DALLE-3",
1316
+ "model_b": "ChatDiT",
1317
+ "winner": "model_a",
1318
+ "judge": "arena_user_10.16.21.179",
1319
+ "anony": true,
1320
+ "tstamp": 1737995248.9082
1321
+ },
1322
+ {
1323
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1324
+ "model_b": "GPT-4o + DALLE-3",
1325
+ "winner": "model_a",
1326
+ "judge": "arena_user_10.16.17.134",
1327
+ "anony": false,
1328
+ "tstamp": 1738006573.8034
1329
+ },
1330
+ {
1331
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1332
+ "model_b": "GPT-4o + DALLE-3",
1333
+ "winner": "model_a",
1334
+ "judge": "arena_user_10.16.6.226",
1335
+ "anony": false,
1336
+ "tstamp": 1738006608.8794
1337
+ },
1338
+ {
1339
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1340
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1341
+ "winner": "tie (bothbad)",
1342
+ "judge": "arena_user_10.16.17.80",
1343
+ "anony": false,
1344
+ "tstamp": 1738240456.0548
1345
+ },
1346
+ {
1347
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1348
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1349
+ "winner": "model_a",
1350
+ "judge": "arena_user_10.20.34.20",
1351
+ "anony": false,
1352
+ "tstamp": 1738414444.0896
1353
+ },
1354
+ {
1355
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1356
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1357
+ "winner": "model_a",
1358
+ "judge": "arena_user_10.20.34.20",
1359
+ "anony": false,
1360
+ "tstamp": 1738414457.7698
1361
+ },
1362
+ {
1363
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1364
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1365
+ "winner": "model_b",
1366
+ "judge": "arena_user_10.20.34.20",
1367
+ "anony": false,
1368
+ "tstamp": 1738414478.2776
1369
+ },
1370
+ {
1371
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1372
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1373
+ "winner": "model_b",
1374
+ "judge": "arena_user_10.20.34.20",
1375
+ "anony": false,
1376
+ "tstamp": 1738414496.1494
1377
+ },
1378
+ {
1379
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1380
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1381
+ "winner": "model_b",
1382
+ "judge": "arena_user_10.20.26.107",
1383
+ "anony": false,
1384
+ "tstamp": 1738414511.3636
1385
+ },
1386
+ {
1387
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1388
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1389
+ "winner": "model_b",
1390
+ "judge": "arena_user_10.16.24.179",
1391
+ "anony": true,
1392
+ "tstamp": 1738430226.7418
1393
+ },
1394
+ {
1395
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1396
+ "model_b": "GPT-4o + PixArt-Sigma",
1397
+ "winner": "tie (bothbad)",
1398
+ "judge": "arena_user_10.16.24.179",
1399
+ "anony": true,
1400
+ "tstamp": 1738478228.5285
1401
+ },
1402
+ {
1403
+ "model_a": "GPT-4o + PixArt-Sigma",
1404
+ "model_b": "GPT-4o + OmniGen",
1405
+ "winner": "model_b",
1406
+ "judge": "arena_user_10.16.39.39",
1407
+ "anony": true,
1408
+ "tstamp": 1738478268.4485
1409
+ },
1410
+ {
1411
+ "model_a": "GPT-4o + Emu2",
1412
+ "model_b": "GPT-4o + PixArt-Sigma",
1413
+ "winner": "tie (bothbad)",
1414
+ "judge": "arena_user_10.16.6.226",
1415
+ "anony": true,
1416
+ "tstamp": 1738478334.8795
1417
+ },
1418
+ {
1419
+ "model_a": "GPT-4o + Emu2",
1420
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1421
+ "winner": "model_a",
1422
+ "judge": "arena_user_10.16.24.179",
1423
+ "anony": true,
1424
+ "tstamp": 1738478358.2449
1425
+ },
1426
+ {
1427
+ "model_a": "GPT-4o + OmniGen",
1428
+ "model_b": "ChatDiT",
1429
+ "winner": "tie (bothbad)",
1430
+ "judge": "arena_user_10.16.39.39",
1431
+ "anony": true,
1432
+ "tstamp": 1738478400.0329
1433
+ },
1434
+ {
1435
+ "model_a": "GPT-4o + OmniGen",
1436
+ "model_b": "GPT-4o + Emu2",
1437
+ "winner": "model_a",
1438
+ "judge": "arena_user_10.16.24.179",
1439
+ "anony": true,
1440
+ "tstamp": 1738478519.1341
1441
+ }
1442
+ ]
arena_elo/results/20250201/elo_results.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:794f7356f69a11cff00fa8652bdc67f9e53ae992c6906d30660ae74aae759dbf
3
+ size 57566
arena_elo/results/20250201/leaderboard.csv ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ key,Model,Arena Elo rating (anony),Arena Elo rating (full),license,creator,link
2
+ GPT-4o + FLUX.1 [dev],GPT-4o + FLUX.1 [dev],1064.608732752121,1051.9755239924489,FLUX.1 [dev] Non-Commercial License,Black Forest Labs,https://huggingface.co/black-forest-labs/FLUX.1-dev
3
+ GPT-4o + Stable Diffusion 3 Medium,GPT-4o + Stable Diffusion 3 Medium,1044.6114958719775,1015.2398523332247,Stability AI Community License,Stability AI,https://huggingface.co/stabilityai/stable-diffusion-3-medium
4
+ GPT-4o + PixArt-Sigma,GPT-4o + PixArt-Sigma,1038.1297388902892,1008.3174801571165,CreativeML Open RAIL++-M License,Huawei Noah's Ark Lab,https://huggingface.co/PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
5
+ ChatDiT,ChatDiT,1031.4458586252017,1014.6002187228936,MIT License,Tongyi Lab,https://github.com/ali-vilab/ChatDiT
6
+ GPT-4o + DALLE-3,GPT-4o + DALLE-3,1000.112282270205,985.901694033994,OpenAI Terms of Use,OpenAI,https://openai.com/index/dall-e-3/
7
+ GPT-4o + OmniGen,GPT-4o + OmniGen,917.2308100572435,867.520902591999,MIT License,BAAI,https://huggingface.co/spaces/Shitao/OmniGen
8
+ GPT-4o + Emu2,GPT-4o + Emu2,903.861081532962,880.7769108192662,Apache License 2.0,BAAI,https://huggingface.co/BAAI/Emu2
arena_elo/results/latest/clean_battle.json CHANGED
@@ -1118,5 +1118,325 @@
1118
  "judge": "arena_user_10.16.37.94",
1119
  "anony": false,
1120
  "tstamp": 1737049726.3072
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1121
  }
1122
  ]
 
1118
  "judge": "arena_user_10.16.37.94",
1119
  "anony": false,
1120
  "tstamp": 1737049726.3072
1121
+ },
1122
+ {
1123
+ "model_a": "GPT-4o + PixArt-Sigma",
1124
+ "model_b": "GPT-4o + DALLE-3",
1125
+ "winner": "model_a",
1126
+ "judge": "arena_user_10.16.39.228",
1127
+ "anony": true,
1128
+ "tstamp": 1737260184.9546
1129
+ },
1130
+ {
1131
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1132
+ "model_b": "ChatDiT",
1133
+ "winner": "tie (bothbad)",
1134
+ "judge": "arena_user_10.16.37.94",
1135
+ "anony": true,
1136
+ "tstamp": 1737342007.3311
1137
+ },
1138
+ {
1139
+ "model_a": "GPT-4o + PixArt-Sigma",
1140
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1141
+ "winner": "model_a",
1142
+ "judge": "arena_user_10.16.19.14",
1143
+ "anony": true,
1144
+ "tstamp": 1737342038.3455
1145
+ },
1146
+ {
1147
+ "model_a": "GPT-4o + Emu2",
1148
+ "model_b": "GPT-4o + PixArt-Sigma",
1149
+ "winner": "model_b",
1150
+ "judge": "arena_user_10.16.37.94",
1151
+ "anony": true,
1152
+ "tstamp": 1737342063.6411
1153
+ },
1154
+ {
1155
+ "model_a": "GPT-4o + Emu2",
1156
+ "model_b": "GPT-4o + PixArt-Sigma",
1157
+ "winner": "model_b",
1158
+ "judge": "arena_user_10.16.39.228",
1159
+ "anony": true,
1160
+ "tstamp": 1737342081.2581
1161
+ },
1162
+ {
1163
+ "model_a": "GPT-4o + DALLE-3",
1164
+ "model_b": "GPT-4o + Emu2",
1165
+ "winner": "tie (bothbad)",
1166
+ "judge": "arena_user_10.16.5.242",
1167
+ "anony": true,
1168
+ "tstamp": 1737342094.1056
1169
+ },
1170
+ {
1171
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1172
+ "model_b": "ChatDiT",
1173
+ "winner": "model_a",
1174
+ "judge": "arena_user_10.16.37.94",
1175
+ "anony": true,
1176
+ "tstamp": 1737342133.6158
1177
+ },
1178
+ {
1179
+ "model_a": "GPT-4o + Emu2",
1180
+ "model_b": "ChatDiT",
1181
+ "winner": "tie (bothbad)",
1182
+ "judge": "arena_user_10.16.5.242",
1183
+ "anony": true,
1184
+ "tstamp": 1737342168.5823
1185
+ },
1186
+ {
1187
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1188
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1189
+ "winner": "model_a",
1190
+ "judge": "arena_user_10.16.5.242",
1191
+ "anony": true,
1192
+ "tstamp": 1737345352.0281
1193
+ },
1194
+ {
1195
+ "model_a": "ChatDiT",
1196
+ "model_b": "GPT-4o + PixArt-Sigma",
1197
+ "winner": "tie (bothbad)",
1198
+ "judge": "arena_user_10.16.37.94",
1199
+ "anony": true,
1200
+ "tstamp": 1737345374.5042
1201
+ },
1202
+ {
1203
+ "model_a": "ChatDiT",
1204
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1205
+ "winner": "tie (bothbad)",
1206
+ "judge": "arena_user_10.16.5.183",
1207
+ "anony": false,
1208
+ "tstamp": 1737345480.6137
1209
+ },
1210
+ {
1211
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1212
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1213
+ "winner": "tie (bothbad)",
1214
+ "judge": "arena_user_10.16.19.14",
1215
+ "anony": true,
1216
+ "tstamp": 1737416898.513
1217
+ },
1218
+ {
1219
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1220
+ "model_b": "GPT-4o + OmniGen",
1221
+ "winner": "model_a",
1222
+ "judge": "arena_user_10.16.19.14",
1223
+ "anony": true,
1224
+ "tstamp": 1737416913.3366
1225
+ },
1226
+ {
1227
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1228
+ "model_b": "GPT-4o + Emu2",
1229
+ "winner": "model_a",
1230
+ "judge": "arena_user_10.16.39.228",
1231
+ "anony": true,
1232
+ "tstamp": 1737416931.2714
1233
+ },
1234
+ {
1235
+ "model_a": "ChatDiT",
1236
+ "model_b": "GPT-4o + OmniGen",
1237
+ "winner": "model_b",
1238
+ "judge": "arena_user_10.16.39.228",
1239
+ "anony": true,
1240
+ "tstamp": 1737456983.6035
1241
+ },
1242
+ {
1243
+ "model_a": "ChatDiT",
1244
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1245
+ "winner": "model_b",
1246
+ "judge": "arena_user_10.16.5.183",
1247
+ "anony": false,
1248
+ "tstamp": 1737617763.3504
1249
+ },
1250
+ {
1251
+ "model_a": "GPT-4o + OmniGen",
1252
+ "model_b": "GPT-4o + DALLE-3",
1253
+ "winner": "model_b",
1254
+ "judge": "arena_user_10.16.46.168",
1255
+ "anony": true,
1256
+ "tstamp": 1737873713.8355
1257
+ },
1258
+ {
1259
+ "model_a": "ChatDiT",
1260
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1261
+ "winner": "model_b",
1262
+ "judge": "arena_user_10.20.26.107",
1263
+ "anony": false,
1264
+ "tstamp": 1737993471.7271
1265
+ },
1266
+ {
1267
+ "model_a": "ChatDiT",
1268
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1269
+ "winner": "model_b",
1270
+ "judge": "arena_user_10.20.34.20",
1271
+ "anony": false,
1272
+ "tstamp": 1737993492.6951
1273
+ },
1274
+ {
1275
+ "model_a": "ChatDiT",
1276
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1277
+ "winner": "model_b",
1278
+ "judge": "arena_user_10.20.26.107",
1279
+ "anony": false,
1280
+ "tstamp": 1737993507.3486
1281
+ },
1282
+ {
1283
+ "model_a": "ChatDiT",
1284
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1285
+ "winner": "model_b",
1286
+ "judge": "arena_user_10.20.34.20",
1287
+ "anony": false,
1288
+ "tstamp": 1737993526.1914
1289
+ },
1290
+ {
1291
+ "model_a": "ChatDiT",
1292
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1293
+ "winner": "model_a",
1294
+ "judge": "arena_user_10.20.34.20",
1295
+ "anony": false,
1296
+ "tstamp": 1737993546.952
1297
+ },
1298
+ {
1299
+ "model_a": "ChatDiT",
1300
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1301
+ "winner": "tie (bothbad)",
1302
+ "judge": "arena_user_10.20.34.20",
1303
+ "anony": false,
1304
+ "tstamp": 1737993570.2157
1305
+ },
1306
+ {
1307
+ "model_a": "ChatDiT",
1308
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1309
+ "winner": "model_b",
1310
+ "judge": "arena_user_10.20.34.20",
1311
+ "anony": false,
1312
+ "tstamp": 1737993609.9858
1313
+ },
1314
+ {
1315
+ "model_a": "GPT-4o + DALLE-3",
1316
+ "model_b": "ChatDiT",
1317
+ "winner": "model_a",
1318
+ "judge": "arena_user_10.16.21.179",
1319
+ "anony": true,
1320
+ "tstamp": 1737995248.9082
1321
+ },
1322
+ {
1323
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1324
+ "model_b": "GPT-4o + DALLE-3",
1325
+ "winner": "model_a",
1326
+ "judge": "arena_user_10.16.17.134",
1327
+ "anony": false,
1328
+ "tstamp": 1738006573.8034
1329
+ },
1330
+ {
1331
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1332
+ "model_b": "GPT-4o + DALLE-3",
1333
+ "winner": "model_a",
1334
+ "judge": "arena_user_10.16.6.226",
1335
+ "anony": false,
1336
+ "tstamp": 1738006608.8794
1337
+ },
1338
+ {
1339
+ "model_a": "GPT-4o + FLUX.1 [dev]",
1340
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1341
+ "winner": "tie (bothbad)",
1342
+ "judge": "arena_user_10.16.17.80",
1343
+ "anony": false,
1344
+ "tstamp": 1738240456.0548
1345
+ },
1346
+ {
1347
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1348
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1349
+ "winner": "model_a",
1350
+ "judge": "arena_user_10.20.34.20",
1351
+ "anony": false,
1352
+ "tstamp": 1738414444.0896
1353
+ },
1354
+ {
1355
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1356
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1357
+ "winner": "model_a",
1358
+ "judge": "arena_user_10.20.34.20",
1359
+ "anony": false,
1360
+ "tstamp": 1738414457.7698
1361
+ },
1362
+ {
1363
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1364
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1365
+ "winner": "model_b",
1366
+ "judge": "arena_user_10.20.34.20",
1367
+ "anony": false,
1368
+ "tstamp": 1738414478.2776
1369
+ },
1370
+ {
1371
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1372
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1373
+ "winner": "model_b",
1374
+ "judge": "arena_user_10.20.34.20",
1375
+ "anony": false,
1376
+ "tstamp": 1738414496.1494
1377
+ },
1378
+ {
1379
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1380
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1381
+ "winner": "model_b",
1382
+ "judge": "arena_user_10.20.26.107",
1383
+ "anony": false,
1384
+ "tstamp": 1738414511.3636
1385
+ },
1386
+ {
1387
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1388
+ "model_b": "GPT-4o + FLUX.1 [dev]",
1389
+ "winner": "model_b",
1390
+ "judge": "arena_user_10.16.24.179",
1391
+ "anony": true,
1392
+ "tstamp": 1738430226.7418
1393
+ },
1394
+ {
1395
+ "model_a": "GPT-4o + Stable Diffusion 3 Medium",
1396
+ "model_b": "GPT-4o + PixArt-Sigma",
1397
+ "winner": "tie (bothbad)",
1398
+ "judge": "arena_user_10.16.24.179",
1399
+ "anony": true,
1400
+ "tstamp": 1738478228.5285
1401
+ },
1402
+ {
1403
+ "model_a": "GPT-4o + PixArt-Sigma",
1404
+ "model_b": "GPT-4o + OmniGen",
1405
+ "winner": "model_b",
1406
+ "judge": "arena_user_10.16.39.39",
1407
+ "anony": true,
1408
+ "tstamp": 1738478268.4485
1409
+ },
1410
+ {
1411
+ "model_a": "GPT-4o + Emu2",
1412
+ "model_b": "GPT-4o + PixArt-Sigma",
1413
+ "winner": "tie (bothbad)",
1414
+ "judge": "arena_user_10.16.6.226",
1415
+ "anony": true,
1416
+ "tstamp": 1738478334.8795
1417
+ },
1418
+ {
1419
+ "model_a": "GPT-4o + Emu2",
1420
+ "model_b": "GPT-4o + Stable Diffusion 3 Medium",
1421
+ "winner": "model_a",
1422
+ "judge": "arena_user_10.16.24.179",
1423
+ "anony": true,
1424
+ "tstamp": 1738478358.2449
1425
+ },
1426
+ {
1427
+ "model_a": "GPT-4o + OmniGen",
1428
+ "model_b": "ChatDiT",
1429
+ "winner": "tie (bothbad)",
1430
+ "judge": "arena_user_10.16.39.39",
1431
+ "anony": true,
1432
+ "tstamp": 1738478400.0329
1433
+ },
1434
+ {
1435
+ "model_a": "GPT-4o + OmniGen",
1436
+ "model_b": "GPT-4o + Emu2",
1437
+ "winner": "model_a",
1438
+ "judge": "arena_user_10.16.24.179",
1439
+ "anony": true,
1440
+ "tstamp": 1738478519.1341
1441
  }
1442
  ]
arena_elo/results/latest/elo_results.pkl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:63ee2afc25d210a56f47f2d40fdd7acc28b8ce959f409d4940c4c2d8b8dd9209
3
- size 57554
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:794f7356f69a11cff00fa8652bdc67f9e53ae992c6906d30660ae74aae759dbf
3
+ size 57566
arena_elo/results/latest/leaderboard.csv CHANGED
@@ -1,8 +1,8 @@
1
  key,Model,Arena Elo rating (anony),Arena Elo rating (full),license,creator,link
2
- GPT-4o + Stable Diffusion 3 Medium,GPT-4o + Stable Diffusion 3 Medium,1102.9463840916624,1079.7854718235376,Stability AI Community License,Stability AI,https://huggingface.co/stabilityai/stable-diffusion-3-medium
3
- GPT-4o + FLUX.1 [dev],GPT-4o + FLUX.1 [dev],1060.6518500297198,1034.6448672311114,FLUX.1 [dev] Non-Commercial License,Black Forest Labs,https://huggingface.co/black-forest-labs/FLUX.1-dev
4
- ChatDiT,ChatDiT,1056.8196120773982,1049.9056120780351,MIT License,Tongyi Lab,https://github.com/ali-vilab/ChatDiT
5
- GPT-4o + PixArt-Sigma,GPT-4o + PixArt-Sigma,1013.6232701542713,993.1059562440837,CreativeML Open RAIL++-M License,Huawei Noah's Ark Lab,https://huggingface.co/PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
6
- GPT-4o + DALLE-3,GPT-4o + DALLE-3,987.3468199967783,994.4966917920444,OpenAI Terms of Use,OpenAI,https://openai.com/index/dall-e-3/
7
- GPT-4o + Emu2,GPT-4o + Emu2,907.3058128436727,891.4032279179117,Apache License 2.0,BAAI,https://huggingface.co/BAAI/Emu2
8
- GPT-4o + OmniGen,GPT-4o + OmniGen,871.3062508064975,823.347088339123,MIT License,BAAI,https://huggingface.co/spaces/Shitao/OmniGen
 
1
  key,Model,Arena Elo rating (anony),Arena Elo rating (full),license,creator,link
2
+ GPT-4o + FLUX.1 [dev],GPT-4o + FLUX.1 [dev],1064.608732752121,1051.9755239924489,FLUX.1 [dev] Non-Commercial License,Black Forest Labs,https://huggingface.co/black-forest-labs/FLUX.1-dev
3
+ GPT-4o + Stable Diffusion 3 Medium,GPT-4o + Stable Diffusion 3 Medium,1044.6114958719775,1015.2398523332247,Stability AI Community License,Stability AI,https://huggingface.co/stabilityai/stable-diffusion-3-medium
4
+ GPT-4o + PixArt-Sigma,GPT-4o + PixArt-Sigma,1038.1297388902892,1008.3174801571165,CreativeML Open RAIL++-M License,Huawei Noah's Ark Lab,https://huggingface.co/PixArt-alpha/PixArt-Sigma-XL-2-1024-MS
5
+ ChatDiT,ChatDiT,1031.4458586252017,1014.6002187228936,MIT License,Tongyi Lab,https://github.com/ali-vilab/ChatDiT
6
+ GPT-4o + DALLE-3,GPT-4o + DALLE-3,1000.112282270205,985.901694033994,OpenAI Terms of Use,OpenAI,https://openai.com/index/dall-e-3/
7
+ GPT-4o + OmniGen,GPT-4o + OmniGen,917.2308100572435,867.520902591999,MIT License,BAAI,https://huggingface.co/spaces/Shitao/OmniGen
8
+ GPT-4o + Emu2,GPT-4o + Emu2,903.861081532962,880.7769108192662,Apache License 2.0,BAAI,https://huggingface.co/BAAI/Emu2