Sentence Break Chart (original) (raw)
Unicode Version: 4.1.0
Date: 2005-03-29, 01:31:34 GMT
Sep | Format | Sp | Lower | Upper | OLetter | Numeric | ATerm | STerm | Close | Other | |
---|---|---|---|---|---|---|---|---|---|---|---|
Sep | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ |
Format | × | × | × | × | × | × | × | × | × | × | × |
Sp | × | × | × | × | × | × | × | × | × | × | × |
Lower | × | × | × | × | × | × | × | × | × | × | × |
Upper | × | × | × | × | × | × | × | × | × | × | × |
OLetter | × | × | × | × | × | × | × | × | × | × | × |
Numeric | × | × | × | × | × | × | × | × | × | × | × |
ATerm | × | × | × | × | × | ÷ | × | ÷ | ÷ | × | ÷ |
STerm | × | × | × | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | × | ÷ |
Close | × | × | × | × | × | × | × | × | × | × | × |
Other | × | × | × | × | × | × | × | × | × | × | × |
Rules
- 1: sot ÷
- 2: ÷ eot
- 3: Sep ÷
- 4: GC -> FC
- 5: X Format* -> X
- 6: ATerm × ( Numeric | Lower )
- 7: Upper ATerm × Upper
- 8: ATerm Close* Sp* × ( ¬(OLetter | Upper | Lower) )* Lower
- 9: ( Term | ATerm ) Close* × ( Close | Sp | Sep )
- 10: ( Term | ATerm ) Close* Sp × ( Sp | Sep )
- 11: ( Term | ATerm ) Close* Sp* ÷
- 12: Any × Any
Sample Strings
- ( " G o . " ) ( H e d i d . )
- ( “ G o ? ” ) ( H e d i d . )
- U . S . A ◌̀ . i s
- U . S . A ◌̀ ? H e
- U . S . A ◌̀ .
- 3 . 4
- c . d
- e t c . ) ’ ‘ ( t h e
- e t c . ) ’ ‘ ( T h e
- t h e r e s p . l e a d e r s a r e
- 字 . 字
- e t c . 它
- e t c . 。
- 字 。 它
- □ ( □ " □ G □ o □ . □ " □ ) □ □ ( □ H □ e □ □ d □ i □ d □ . □ ) □ □
- □ ( □ “ □ G □ o □ ? □ ” □ ) □ □ ( □ H □ e □ □ d □ i □ d □ . □ ) □ □
- □ U □ . □ S □ . □ A □ ◌̀ . □ □ i □ s □ □
- □ U □ . □ S □ . □ A □ ◌̀ ? □ □ H □ e □ □
- □ U □ . □ S □ . □ A □ ◌̀ . □ □
- □ 3 □ . □ 4 □ □
- □ c □ . □ d □ □
- □ e □ t □ c □ . □ ) □ ’ □ □ ‘ □ ( □ t □ h □ e □ □
- □ e □ t □ c □ . □ ) □ ’ □ □ ‘ □ ( □ T □ h □ e □ □
- □ t □ h □ e □ □ r □ e □ s □ p □ . □ □ l □ e □ a □ d □ e □ r □ s □ □ a □ r □ e □ □
- □ 字 □ . □ 字 □ □
- □ e □ t □ c □ . □ 它 □ □
- □ e □ t □ c □ . □ 。 □ □
- □ 字 □ 。 □ 它 □ □