Hyeonsu commited on
Commit
97d7ab8
1 Parent(s): 43eb40d

Update draft of model card

Browse files
Files changed (1) hide show
  1. README.md +45 -0
README.md CHANGED
@@ -1,3 +1,48 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ tags:
4
+ - Scene Text Removal
5
+ - Image to Image
6
+ library_name: pytorch
7
  ---
8
+
9
+ ### GaRNet
10
+
11
+ This is text-removal model that introduced in the paper below and first released at [this page](https://github.com/naver/garnet). \
12
+ [The Surprisingly Straightforward Scene Text Removal Method With Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis](https://arxiv.org/abs/2210.07489). \
13
+ Hyeonsu Lee, Chankyu Choi \
14
+ Naver Corp. \
15
+ In ECCV 2022.
16
+
17
+ ### Model description
18
+
19
+ GaRNet is a generator that create non-text image with given image and coresponding text box mask. It consists of convolution encoder and decoder. The encoder consists of residual block with attention module called Gated Attention.
20
+
21
+ Gated Attention module has two Spatial attention branch. Each attention branch finds text stroke or its surrounding regions. The module adjusts the weight of these two domains by trainable parameters.
22
+
23
+ The model was trained in PatchGAN manner with Region-of-Interest Generation. \
24
+ The discriminator is consists of convolution encoder. Given an image, it determines whether each patch, which indicates text-box regions, is real or fake.
25
+ All loss functions treat non-textbox regions as 'don't care'.
26
+
27
+ ### Intended uses & limitations
28
+
29
+ This model can be used for areas that require the process of erasing text from an image, such as concealment private information, text editing.\
30
+ You can use the raw model or pre-trained model.\
31
+ Note that pre-trained model was trained in both Synthetic and SCUT_EnsText dataset. And the SCUT-EnsText dataset can only be used for non-commercial research purposes.
32
+
33
+ ### How to use
34
+
35
+ You can use inference code in [this page](https://github.com/naver/garnet).
36
+
37
+ ### BibTeX entry and citation info
38
+
39
+ ```
40
+ @inproceedings{lee2022surprisingly,
41
+ title={The Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis},
42
+ author={Lee, Hyeonsu and Choi, Chankyu},
43
+ booktitle={European Conference on Computer Vision},
44
+ pages={457--472},
45
+ year={2022},
46
+ organization={Springer}
47
+ }
48
+ ```