File size: 19,226 Bytes
c176aea
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
# :orange[Abstract:] 
  Hall effect thrusters are one of the most versatile and
  popular electric propulsion systems for space use. Industry trends
  towards interplanetary missions arise advances in design development
  of such propulsion systems. It is understood that correct sizing of
  discharge channel in Hall effect thruster impact performance greatly.
  Since the complete physics model of such propulsion system is not yet
  optimized for fast computations and design iterations, most thrusters
  are being designed using so-called scaling laws. But this work focuses
  on rather novel approach, which is outlined less frequently than
  ordinary scaling design approach in literature. Using deep machine
  learning it is possible to create predictive performance model, which
  can be used to effortlessly get design of required hall thruster with
  required characteristics using way less computing power than design
  from scratch and way more flexible than usual scaling approach.
:orange[author:] Korolev K.V [^1]
title: Hall effect thruster design via deep neural network for additive
  manufacturing

# Nomenclature

<div class="longtable*" markdown="1">

$U_d$ = discharge voltage  
$P$ = discharge power  
$T$ = thrust  
$\dot{m}_a$ = mass flow rate  
$I_{sp}$ = specific impulse  
$\eta_m$ = mass utilization efficiency  
$\eta_a$ = anode efficiency  
$j$ = $P/v$ \[power density\]  
$v$ = discharge channel volume  
$h, d, L$ = generic geometry parameters  
$C_*$ = set of scaling coefficients  
$g$ = free-fall acceleration  
$M$ = ion mass

</div>

# Introduction

<span class="lettrine">T</span><span class="smallcaps">he</span>
application of deep learning is extremely diverse, but in this study it
focuses on case of hall effect thruster design. Hall effect thruster
(HET) is rather simple DC plasma acceleration device, due to complex and
non linear process physics we don’t have any full analytical performance
models yet. Though there are a lot of ways these systems are designed in
industry with great efficiencies, but in cost of multi-million research
budgets and time. This problem might be solved using neural network
design approach and few hardware iteration tweaks(Plyashkov et al.
2022-10-25).

Scaled thrusters tend to have good performance but this approach isn’t
that flexible for numerous reasons: first and foremost, due to large
deviations in all of the initial experimental values accuracy can be not
that good, secondly, it is hardly possible to design thruster with
different power density or $I_{sp}$ efficiently.

On the other hand, the neural network design approach has accuracy
advantage only on domain of the dataset(Plyashkov et al. 2022-10-25),
this limitations is easily compensated by ability to create relations
between multiple discharge and geometry parameters at once. Hence this
novel approach and scaling relations together could be an ultimate
endgame design tool for HET.

Note that neither of these models do not include cathode efficiencies
and performances. So as the neutral gas thrust components. Most
correlations in previous literature were made using assumption or
physics laws(Shagayda and Gorshkov 2013-03), in this paper the new
method based on feature generation, GAN dataset augmentation and ML
feature selection is suggested.

## Dataset enlargement using GAN

As we already have discussed, the data which is available is not enough
for training NN or most ML algorithms, so I suggest using Generative
Adversarial Network to generate more similar points. Generative model
trains two different models - generator and discriminator. Generator
learns how to generate new points which are classified by discriminator
as similar to real dataset. Of course it is very understandable that
model needs to be precise enough not to overfit on data or create new
unknown correlations. Model was checked via Mean Absolute Percentage
Error (MAPE) and physical boundary conditions. After assembling most
promising architecture, the model was able to generate fake points with
MAPE of $~4.7\%$. We need to measure MAPE to be sure point lie on same
domain as original dataset, as in this work we are interested in
sub-kilowatt thrusters. After model generated new points they were check
to fit in physical boundaries of scaled values (for example thrust
couldn’t be more than 2, efficiency more than 1.4 and so on, data was
scaled on original dataset to retain quality), only 0.02% of points were
found to be outliers. The GAN architecture and dataset sample is
provided as follows.

<!-- ![GAN architecture](gen.png "GAN architecture")
![Sample of generated datagray - fake, blue - real](dT.png "Sample of generated datagray - fake, blue - real") -->

# General Relations

As we will use dataset of only low power hall thrusters, we can just
ignore derivation of any non-linear equations and relations and use
traditional approach here. Let’s define some parameters of anode:
$$\alpha = \frac{\dot{m}\beta}{{\dot{m}_a}},$$
Where $\alpha$ is anode
parameter of $\beta$ thruster parameter. This is selected because this
way cathode and other losses wont be included in the model. One of key
differences in this approach is fitting only best and most appropriate
data, thus we will eliminate some variance in scaling laws. Though due
to machine learning methods, we would need a lot of information which is
simply not available in those volumes. So some simplifications and
assumptions could be made. Firstly, as it was already said, we don’t
include neutralizer efficiency in the model. Secondly, the model would
be correct on very specific domain, defined by dataset, many parameters
like anode power and $I_{sp}$ still are using semi-empirical modelling
approach. The results we are looking for are outputs of machine learning
algorithm: specific impulse, thrust, efficiency, optimal mass flow rate,
power density. Function of input is solely dependant on power and
voltage range. For the matter of topic let’s introduce semi-empirical
equations which are used for scaling current thrusters.

<div class="longtable*" markdown="2">

$$h=C_hd$$

$$\dot{m_a} = C_m hd$$

$$P_d=C_pU_dd^2$$

$$T=C_t\dot{m_a}\sqrt{U_d}$$

$$I_{spa}=\frac{T}{\dot{m_a} g}$$

$$\eta_a=\frac{T}{2\dot{m_a}P_d}$$

</div>

Where $C_x$ is scaling coefficient obtained from analytical modelling,
which makes equations linear. Generally it has 95% prediction band but
as was said earlier this linearity is what gives problems to current
thrusters designs (high mass, same power density, average performance).
The original dataset is

|          |          |        |       |       |       |              |      |           |
|:---------|:---------|:-------|:------|:------|:------|:-------------|:-----|:----------|
| Thruster | Power, W | U_d, V | d, mm | h, mm | L, mm | m_a,.g/s,    | T, N | I\_spa, s |
| SPT-20   | 52.4     | 180    | 15.0  | 5.0   | 32.0  | 0.47         | 3.9  | 839       |
| SPT-25   | 134      | 180    | 20.0  | 5.0   | 10    | 0.59         | 5.5  | 948       |
| Music-si | 140      | 288    | 18    | 2     | 6.5   | 0.44         | 4.2  | 850       |
| HET-100  | 174      | 300    | 23.5  | 5.5   | 14.5  | 0.50         | 6.8  | 1386      |
| KHT-40   | 187      | 325    | 31.0  | 9.0   | 25.5  | 0.69         | 10.3 | 1519      |
| KHT-50   | 193      | 250    | 42.0  | 8.0   | 25.0  | 0.88         | 11.6 | 1339      |
| HEPS-200 | 195      | 250    | 42.5  | 8.5   | 25.0  | 0.88         | 11.2 | 1300      |
| BHT-200  | 200      | 250    | 21.0  | 5.6   | 11.2  | 0.94         | 12.8 | 1390      |
| KM-32    | 215      | 250    | 32.0  | 7.0   | 16.0  | 1.00         | 12.2 | 1244      |
| ...      |          |        |       |       |       |              |      |           |
| HEPS-500 | 482      | 300    | 49.5  | 15.5  | 25.0  | 1.67         | 25.9 | 1587      |
| UAH-78AM | 520      | 260    | 78.0  | 20    | 40    | 2            | 30   | 1450      |
| BHT-600  | 615      | 300    | 56.0  | 16.0  | 32    | 2.60         | 39.1 | 1530      |
| SPT-70   | 660      | 300    | 56.0  | 14.0  | 25.0  | 2.56         | 40.0 | 1593      |
| MaSMi60  | 700      | 250    | 60    | 9.42  | 19    | 2.56         | 30   | 1300      |
| MaSMiDm  | 1000     | 500    | 67    | 10.5  | 21    | 3            | 53   | 1940      |
| SPT-100  | 1350     | 300    | 85.0  | 15.0  | 25.0  | 5.14         | 81.6 | 1540      |

Hosting only 24 entries in total. The references are as follows(Beal et
al. 2004-11)(Belikov et al. 2001-07-08)(Kronhaus et al. 2013-07)(Misuri
and Andrenucci 2008-07-21)(Lee et al. 2019-11)

In the next section the used neural networks architectures will be
discussed.

# Data driven HET designs

Neural networks are a type of machine learning algorithm that is often
used in the field of artificial intelligence. They are mathematical
models that can be trained to recognize patterns within large datasets.
The architecture of GAN’s generator was already shown. In this section
we will focus on fully connected networks, which are most popular for
type for these tasks. HETFit code leverages dynamic architecture
generation of these FcNN’s which is done via meta learning algorithm
Tree-structured Parzen Estimator for every data input user selects. This
code uses state-of-art implementation made by OPTUNA. The dynamically
suggested architecture has 2 to 6 layers from 4 to 128 nodes on each
with SELU, Tanh or ReLU activations and most optimal optimizer. The code
user interface is as follows: 1. Specify working environment 2. Load or
generate data 3. Tune the architecture 4. Train and get robust scaling
models

## FNN

All of Fully connected neural networks are implemented in PyTorch as it
the most powerful ML/AI library for experiments. When the network
architecture is generated, all of networks have similar training loops
as they use gradient descend algorithm : Loss function:
$$L(w, b) \equiv \frac{1}{2 n} \sum_x\|y(x)-a\|^2$$ This one is mean
square error (MSE) error function most commonly used in FNNs. Next we
iterate while updating weights for a number of specified epochs this
way. Loop for number of epochs:

\- Get predictions: $\hat{y}$

\- Compute loss: $\mathscr{L}(w, b)$

\- Make backward pass

\- Update optimizer

It can be mentioned that dataset of electric propulsion is extremely
complex due to large deviations in data. Thanks to adavnces in data
science and ML it is possible to work with it.

This way we assembled dataset on our ROI domain of $P$\<1000 $W$ input
power and 200-500 $V$ range. Sadly one of limitations of such model is
disability to go beyond actual database limit while not sacrificing
performance and accuracy.

## Physics Informed Neural Networks

For working with unscaled data PINN’s were introduced, they are using
equations 2-7 to generate $C_x$ coefficients. Yes, it was said earlier
that this method lacks ability to generate better performing HETs, but
as we have generated larger dataset on same domain as Lee et al.
(2019-11) it is important to control that our dataset is still the same
quality as original. Using above mentioned PINN’s it was possible to fit
coefficients and they showed only slight divergence in values of few %
which is acceptable.

## ML approach notes

We already have discussed how HETFit code works and results it can
generate, the overiew is going to be given in next section. But here i
want to warn that this work is highly experimental and you should always
take ML approaches with a grain of salt, as some plasma discharge
physics in HET is yet to be understood, data driven way may have some
errors in predictions on specific bands. Few notes on design tool I have
developed in this work: it is meant to be used by people with little to
no experience in ML field but those who wants to quickly analyze their
designs or create baseline one for simulations. One can even use this
tool for general tabular data as it has mostly no limits whatsoever to
input data.

## Two input variables prediction

One of main characteristics for any type of thruster is efficiency, in
this work I researched dependency of multiple input values to $\eta_t$.
Results are as follows in form of predicted matrix visualisations.
Figure 3 takes into account all previous ones in the same time, once
again it would be way harder to do without ML.



# Results discussion

Let’s compare predictions of semi empirical approach(Lee et al.
2019-11), approach in paper(Plyashkov et al. 2022-10-25), and finally
ours. Worth to mention that current approach is easiest to redesign from
scratch.

## NN architecture generation algorithm

As with 50 iterations, previously discussed meta learning model is able
to create architecture with score of 0.9+ in matter of seconds. HETFit
allows logging into neptune.ai environment for full control over
simulations. Example trail run looks like that.



## Power density and magnetic flux dependence

Neither of the models currently support taking magnetic flux in account
besides general physics relations, but we are planning on updating the
model in next follow up paper. For now $\vec{B}$ relation to power
remains unresolved to ML approach but the magnetic field distribution on
z axis is computable and looks like that for magnetically shielded
thrusters:



## Dependency of T on d,P

Following graph is describing Thrust as function of channel diameter and
width, where hue map is thrust. It is well known dependency and it has
few around 95% prediction band (Lee et al. 2019-11)



## Dependency of T on P,U



## Dependency of T on $m_a$,P

Compared to(Shagayda and Gorshkov 2013-03) The model accounts for more
parameters than linear relation. So such method proves to be more
precise on specified domain than semi empirical linear relations.



## Dependency of $I_{sp}$ on d,h



We generated many models so far, but using ML we can make single model
for all of the parameters at the same time, so these graphs tend to be
3d projection of such model inference.

## Use of pretrained model in additive manufacturing of hall effect thruster channels

The above mentioned model was used to predict geometry of channel, next
the simulation was conducted on this channel. Second one for comparison
was calculated via usual scaling laws. The initial conditions for both
are:

| Initial condition | Value             |
|:------------------|:------------------|
| $n_{e,0}$         | 1e13 \[m\^-3\]    |
| $\epsilon_0$      | 4 \[V\]           |
| V                 | 300 \[V\]         |
| T                 | 293.15 \[K\]      |
| P\_abs            | 0.5 \[torr\]      |
| $\mu_e N_n$       | 1e25 \[1/(Vm s)\] |
| dt                | 1e-8 \[s\]        |
| Body              | Ar                |

Outcomes are so that ML geometry results in higher density generation of
ions which leads to more efficient thrust generation. HETFit code
suggests HET parameters by lower estimate to compensate for not included
variables in model of HET. This is experimentally proven to be efficient
estimate since SEM predictions of thrust are always higher than real
performance. Lee et al. (2019-11)



## Code description

Main concepts: - Each observational/design session is called an
environment, for now it can be either RCI or SCI (Real or scaled
interface)

\- Most of the run parameters are specified on this object
initialization, including generation of new samples via GAN

\- Built-in feature generation (log10 Power, efficiency, $\vec{B}$,
etc.)

\- Top feature selection for each case. (Boruta algorithm)

\- Compilation of environment with model of choice, can be any torch
model or sklearn one

\- Training

\- Plot, inference, save, export to jit/onnx, measure performance

## COMSOL HET simulations

The simulations were conducted in COMSOL in plasma physics interface
which gives the ability to accurately compute Electron densities,
temperatures, energy distribution functions from initial conditions and
geometry. Here is comparison of both channels.



# Conclusion

In conclusion the another model of scaling laws was made and presented.
HETFit code is open source and free to be used by anyone. Additively
manufactured channel was printed to prove it’s manufactureability.
Hopefully this work will help developing more modern scaling relations
as current ones are far from perfect.

Method in this paper and firstly used in Plyashkov et al. (2022-10-25)
has advantages over SEM one in: ability to preidct performance more
precisely on given domain, account for experimental data. I believe with
more input data the ML method of deisgning thrusters would be more
widely used.

The code in this work could be used with other tabular experimental data
since most of cases and tasks tend to be the same: feature selection and
model optimization.


<div id="refs" class="references csl-bib-body hanging-indent"
markdown="1">

<div id="ref-beal_plasma_2004" class="csl-entry" markdown="1">

Beal, Brian E., Alec D. Gallimore, James M. Haas, and William A. Hargus.
2004-11. “Plasma Properties in the Plume of a Hall Thruster Cluster.”
*Journal of Propulsion and Power* 20 (6): 985–91.
<https://doi.org/10.2514/1.3765>.

</div>

<div id="ref-belikov_high-performance_2001" class="csl-entry"
markdown="1">

Belikov, M., O. Gorshkov, V. Muravlev, R. Rizakhanov, A. Shagayda, and
A. Snnirev. 2001-07-08. “High-Performance Low Power Hall Thruster.” In
*37th Joint Propulsion Conference and Exhibit*. Salt Lake
City,UT,U.S.A.: American Institute of Aeronautics; Astronautics.
<https://doi.org/10.2514/6.2001-3780>.

</div>

<div id="ref-kronhaus_discharge_2013" class="csl-entry" markdown="1">

Kronhaus, Igal, Alexander Kapulkin, Vladimir Balabanov, Maksim
Rubanovich, Moshe Guelman, and Benveniste Natan. 2013-07. “Discharge
Characterization of the Coaxial Magnetoisolated Longitudinal Anode Hall
Thruster.” *Journal of Propulsion and Power* 29 (4): 938–49.
<https://doi.org/10.2514/1.B34754>.

</div>

<div id="ref-lee_scaling_2019" class="csl-entry" markdown="1">

Lee, Eunkwang, Younho Kim, Hodong Lee, Holak Kim, Guentae Doh, Dongho
Lee, and Wonho Choe. 2019-11. “Scaling Approach for Sub-Kilowatt
Hall-Effect Thrusters.” *Journal of Propulsion and Power* 35 (6):
1073–79. <https://doi.org/10.2514/1.B37424>.

</div>

<div id="ref-misuri_het_2008" class="csl-entry" markdown="1">

Misuri, Tommaso, and Mariano Andrenucci. 2008-07-21. “HET Scaling
Methodology: Improvement and Assessment.” In *44th AIAA/ASME/SAE/ASEE
Joint Propulsion Conference &Amp; Exhibit*. Hartford, CT: American
Institute of Aeronautics; Astronautics.
<https://doi.org/10.2514/6.2008-4806>.

</div>

<div id="ref-plyashkov_scaling_2022" class="csl-entry" markdown="1">

Plyashkov, Yegor V., Andrey A. Shagayda, Dmitrii A. Kravchenko, Fedor D.
Ratnikov, and Alexander S. Lovtsov. 2022-10-25. “On Scaling of
Hall-Effect Thrusters Using Neural Nets,” 2022-10-25.
<http://arxiv.org/abs/2206.04440>.

</div>

<div id="ref-shagayda_hall-thruster_2013" class="csl-entry"
markdown="1">

Shagayda, Andrey A., and Oleg A. Gorshkov. 2013-03. “Hall-Thruster
Scaling Laws.” *Journal of Propulsion and Power* 29 (2): 466–74.
<https://doi.org/10.2514/1.B34650>.

</div>

</div>

[^1]: Founder, Pure EP