关于xslt：当标志重复且不连续时如何在标志之间选择节点

How to select nodes between flags when flags repeat and are not consecutive

给定输入文档是一系列相同级别的节点，我想找到出现在两个标志(它们本身就是节点)之间的那些节点。标志可以多次使用，最终结果应该将相同标志之间的所有内容组合在一起。我在这方面表现出色。

给定这个输入文档：

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

<root>
Hello world 1.
Hello world 2.
Hello world 3.
Dummy text 
Hello world 4.
Hello world 5.
Hello world 6.
Dummy text 
Hello world 7.
Dummy text 
Hello world 8.
Dummy text 
Hello world 9.
Dummy text for starting a new excerpt 
Hello world 10.
Hello world 11.
Dummy text 
Hello world 12.
Hello world 13.
Hello world 14.
Hello world 15.
Hello world 16.
Hello world 17.
</root>

我想要这个输出：

1
2
3
4
5
6
7
8
9
10
11
12

<root>
Dummy text
Hello world 4.
Hello world 5.
Hello world 6.
Hello world 10.
Hello world 11.
Dummy text
Dummy text
Hello world 8.
Dummy text
</root>

注意：标志总是以 "excerptstart" 和 "excerptend" 开头，并且标志的后缀将始终匹配(也就是说，商业规则保证总会有一个 "excerptendone"，如果有一个"excerptstartone").

这是我目前所拥有的。只要我对摘录开始后缀进行硬编码(即\\'one\\'、\\'two\\')，我就可以找到我想要的集合。我坚持试图概括它，因此后缀不必硬编码(我还应该说我不关心在结果树中保留开始/结束段落"标志"；我\\为了方便评估结果树，已将它们硬编码在此处)：

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
version="2.0">
<xsl:template match="root">
<root>
Dummy text
<xsl:for-each select="p[@class='excerptstartone']">
<xsl:sequence select="following-sibling::node() intersect following-sibling::p[@class='excerptendone'][1]/preceding-sibling::node()"/>
</xsl:for-each>
Dummy text
Dummy text
<xsl:for-each select="p[@class='excerptstarttwo']">
<xsl:sequence select="following-sibling::node() intersect following-sibling::p[@class='excerptendtwo'][1]/preceding-sibling::node()"/>
</xsl:for-each>
Dummy text
</root>
</xsl:template>
<xsl:template match="text()"/>
</xsl:stylesheet>

看看例如这种 Kayessian 方法。

或者试试这个：

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:output omit-xml-declaration="yes" indent="yes"/>
<xsl:key name="kFollowing" match="p"
use="generate-id(preceding-sibling::p[starts-with(@class, 'excerptstart')][1])"/>

<xsl:key name="kExcerptstart" match="p[starts-with(@class, 'excerptstart')]" use="@class"/>

<xsl:template match="/*">
<xsl:copy>
<xsl:apply-templates select="p"/>
</xsl:copy>
</xsl:template>

<xsl:template match="p" />
<xsl:template match="p[ generate-id() = generate-id( key( 'kExcerptstart', @class)[1])]">
<xsl:copy-of select="."/>
<xsl:variable name="start" select="@class" />
<xsl:for-each select=" key( 'kExcerptstart', $start)">
<xsl:variable name="end" select="following-sibling::p[starts-with(@class, 'excerptend')][1]"/>
<xsl:variable name="ns1" select="following-sibling::*" />
<xsl:variable name="ns2" select="$end/preceding-sibling::*" />

<xsl:copy-of select="$ns1[count(.|$ns2) = count($ns2)]"/>
</xsl:for-each>
<xsl:copy-of select="following-sibling::p[starts-with(@class, 'excerptend')][1]"/>
</xsl:template>
</xsl:stylesheet>

这将产生以下输出：

1
2
3
4
5
6
7
8
9
10
11
12