[LeetCode] 1410. HTML Entity Parser

HTML entity parser is the parser that takes HTML code as input and replace all the entities of the special characters by the characters itself.

The special characters and their entities for HTML are:
Quotation Mark: the entity is " and symbol character is “.
Single Quote Mark: the entity is ' and symbol character is ‘.
Ampersand: the entity is & and symbol character is &.
Greater Than Sign: the entity is > and symbol character is >.
Less Than Sign: the entity is < and symbol character is <.
Slash: the entity is ⁄ and symbol character is /.
Given the input text string to the HTML parser, you have to implement the entity parser.

Return the text after replacing the entities by the special characters.

Example 1:
Input: text = “& is an HTML entity but &ambassador; is not.”
Output: “& is an HTML entity but &ambassador; is not.”
Explanation: The parser will replace the & entity by &

Example 2:
Input: text = “and I quote: "…"”
Output: “and I quote: "…"“

Constraints:
1 <= text.length <= 105
The string may contain any possible characters out of all the 256 ASCII characters.

HTML 实体解析器。

「HTML 实体解析器」 是一种特殊的解析器,它将 HTML 代码作为输入,并用字符本身替换掉所有这些特殊的字符实体。

HTML 里这些特殊字符和它们对应的字符实体包括:
双引号:字符实体为 " ,对应的字符是 “ 。
单引号:字符实体为 ' ,对应的字符是 ‘ 。
与符号:字符实体为 & ,对应对的字符是 & 。
大于号:字符实体为 > ,对应的字符是 > 。
小于号:字符实体为 < ,对应的字符是 < 。
斜线号:字符实体为 ⁄ ,对应的字符是 / 。
给你输入字符串 text ,请你实现一个 HTML 实体解析器,返回解析器解析后的结果。

思路

用java自带的string.replace()函数做,记得把对&符号的判断放在最后。否则如下这个 case 会有错。
Input => “&gt;”
如果不把 &amp; 放到最后判断,这个 input 会被判定成 &gt;,即>

复杂度

时间O(n)
空间O(1)

代码

Java实现

1
2
3
4
5
class Solution {
public String entityParser(String text) {
return text.replace("&quot;", "\"").replace("&apos;", "'").replace("&gt;", ">").replace("&lt;", "<").replace("&frasl;", "/").replace("&amp;", "&");
}
}

[LeetCode] 1410. HTML Entity Parser
https://shurui91.github.io/posts/2208021994.html
Author
Aaron Liu
Posted on
November 22, 2023
Licensed under