Here’s a string. I want to remove a C-style comments with the comments itself. Without using regex
JavaScript
x
2
1
a = "word234 /*12aaa12*/"
2
I want the output to be just:
word234
Advertisement
Answer
Here is a simple algorithm that keep the state over 2 characters and uses a flag to keep or not the characters.
JavaScript
1
20
20
1
a = "word234 /*12aaa12*/ word123 /*xx*xx*/ end"
2
3
out = []
4
add = True
5
prev = None
6
for c in a:
7
if c == '*' and prev == '/':
8
if add:
9
del out[-1]
10
add = False
11
if c == '/' and prev == '*':
12
add = True
13
prev = c
14
continue
15
prev = c
16
if add:
17
out.append(c)
18
s2 = ''.join(out)
19
print(s2)
20
Output:
JavaScript
1
2
1
word234 word123 end
2
If you want to handle nested comments (not sure if this exists, but this is fun to do), the algorithm is easy to modify to use a flag that counts the depth level:
JavaScript
1
20
20
1
a = "word234 /*12aaa12*/ word123 /*xx/*yy*/xx*/ end"
2
3
out = []
4
lvl = 0
5
prev = None
6
for c in a:
7
if c == '*' and prev == '/':
8
if lvl == 0:
9
del out[-1]
10
lvl -= 1
11
if c == '/' and prev == '*':
12
lvl += 1
13
prev = c
14
continue
15
prev = c
16
if lvl == 0:
17
out.append(c)
18
s2 = ''.join(out)
19
print(s2)
20