mastertexturepk.com - XylotrechusZ

info server
Uname: Linux premium294.web-hosting.com 4.18.0-553.45.1.lve.el8.x86_64 #1 SMP Wed Mar 26 12:08:09 UTC 2025 x86_64
Software: LiteSpeed
PHP version: 8.1.32 [ PHP INFO ] PHP os: Linux
Server Ip: 104.21.32.1
Your Ip: 216.73.216.223
User: mjbynoyq (1574) | Group: mjbynoyq (1570)
Safe Mode: OFF
Disable Function:
NONE
upload mass deface mass delete console
name : diff.cpython-312.pyc
�

ƒ�g�v���ddlZddlmZddlmZddlZddgZ	ddlmZ		eZ
	ed�Zefd�Zd	�Zd
�Zd�Zd�Zd
�Zd�Zd�Zd�ZdDd�Zd�ZGd�d�ZGd�d�ZGd�de�Z d�Z!d�Z"d�Z#d�Z$d�Z%d�Z&Gd�d e
�Z'Gd!�d"e'�Z(Gd#�d$e'�Z)dEd%�Z*dEd&�Z+ejXd'ejZej\z�Z/ejXd(ejZej\z�Z0ejXd)ejZej\z�Z1d*�Z2ejXd+�Z3d,�Z4d-�Z5d.Z6d/Z7d0Z8dDd1�Z9ejXd2ejt�Z;d3�Z<ejXd4�Z=d5�Z>d6�Z?d7�Z@d8�ZAd9�ZBd:�ZCdDd;�ZDd<�ZEd=�ZFd>�ZGd?�ZHGd@�dAej��ZJeKdBk(rddClmLZLeLj��yy#e
$r
ddlmZ	Y���wxYw#e$reZ
Y���wxYw#e$reZY���wxYw)F�N)�etree)�fragment_fromstring�
html_annotate�htmldiff)�escapec�:�dtt|�d��d|�d�S)Nz
<span title="�z">z</span>)�html_escape�_unicode)�text�versions  �?/opt/hc_python/lib64/python3.12/site-packages/lxml/html/diff.py�default_markuprs���H�W�%�q�)�4�1�1�c���|D��cgc]\}}t||���}}}|d}|ddD]}t||�|}�t|�}t||�}dj	|�j�Scc}}w)a
    doclist should be ordered from oldest to newest, like::

        >>> version1 = 'Hello World'
        >>> version2 = 'Goodbye World'
        >>> print(html_annotate([(version1, 'version 1'),
        ...                      (version2, 'version 2')]))
        <span title="version 2">Goodbye</span> <span title="version 1">World</span>

    The documents must be *fragments* (str/UTF8 or unicode), not
    complete documents

    The markup argument is a function to markup the spans of words.
    This function is called like markup('Hello', 'version 2'), and
    returns HTML.  The first argument is text and never includes any
    markup.  The default uses a span with a title:

        >>> print(default_markup('Some Text', 'by Joe'))
        <span title="by Joe">Some Text</span>
    rr	N�)�tokenize_annotated�html_annotate_merge_annotations�compress_tokens�markup_serialize_tokens�join�strip)�doclist�markup�docr
�	tokenlist�
cur_tokens�tokens�results        rrr"s���6&-�.�%,�\�S�'�$�C��1�%,��.��1��J��A�B�-��'�
�F�;��
� �!��,�J�
$�Z��
8�F�
�7�7�6�?� � �"�"��.s�A2c�<�t|d��}|D]	}||_�|S)zFTokenize a document and add an annotation attribute to each token
    F��
include_hrefs)�tokenize�
annotation)rr$r�toks    rrrJs&���c��
/�F���#�����Mrc��t||��}|j�}|D]$\}}}}}|dk(s�|||}	|||}
t|	|
��&y)z�Merge the annotations from tokens_old into tokens_new, when the
    tokens in the new document already existed in the old document.
    ��a�b�equalN)�InsensitiveSequenceMatcher�get_opcodes�copy_annotations)�
tokens_old�
tokens_new�s�commands�command�i1�i2�j1�j2�eq_old�eq_news           rrrRsZ��	#�Z�:�>�A��}�}��H�#+����R��R��g����2�&�F���2�&�F��V�V�,�	$,rc��t|�t|�k(sJ�t||�D]\}}|j|_�y)zN
    Copy annotations from the tokens listed in src to the tokens in dest
    N)�len�zipr$)�src�dest�src_tok�dest_toks    rr-r-_s=���s�8�s�4�y� � � � ��d�^����%�0�0���,rc���|dg}|ddD]W}|djs5|js)|dj|jk(r
t||��G|j	|��Y|S)zm
    Combine adjacent tokens when there is no HTML between the tokens, 
    and they share an annotation
    rr	N���)�	post_tags�pre_tagsr$�compress_merge_back�append)rrr%s   rrrgsf��
�Q�i�[�F��a�b�z���r�
�$�$�����2�J�!�!�S�^�^�3����,��M�M�#��
��Mrc�R�|d}t|�tust|�tur|j|�yt|�}|jr||jz
}||z
}t||j
|j|j��}|j|_||d<y)zY Merge tok into the last element of tokens (modifying the list of
    tokens in-place).  rA�rCrB�trailing_whitespaceN)�type�tokenrErrHrCrBr$)rr%�lastr�mergeds     rrDrDvs����"�:�D��D�z���$�s�)�5�"8��
�
�c����~���#�#��D�,�,�,�D������t� $�
�
�!$���+.�+B�+B�D��!�O�O�����r�
rc#�K�|D]l}|jEd{���|j�}|||j�}|jr||jz
}|��|jEd{����ny7�]7�	�w)zz
    Serialize the list of tokens into a list of text chunks, calling
    markup_func around text to add annotations.
    N)rC�htmlr$rHrB)r�markup_funcrJrNs    rrr�sp����
���>�>�!�!��z�z�|���4��!1�!1�2���$�$��E�-�-�-�D��
��?�?�"�"��!��	#�s"�A9�A5�AA9�-A7�.A9�7A9c��t|�}t|�}t||�}dj|�j�}t	|�S)a� Do a diff of the old and new document.  The documents are HTML
    *fragments* (str/UTF8 or unicode), they are not complete documents
    (i.e., no <html> tag).

    Returns HTML with <ins> and <del> tags added around the
    appropriate text.  

    Markup is generally ignored, with the markup from new_html
    preserved, and possibly some markup from old_html (though it is
    considered acceptable to lose some of the old markup).  Only the
    words in the HTML are diffed.  The exception is <img> tags, which
    are treated like words, and the href attribute of <a> tags, which
    are noted inside the tag itself when there are changes.
    r)r#�htmldiff_tokensrr�fixup_ins_del_tags)�old_html�new_html�old_html_tokens�new_html_tokensrs     rrr�sE��"�x�(�O��x�(�O�
�_�o�
>�F�
�W�W�V�_�
"�
"�
$�F��f�%�%rc�P�t||��}|j�}g}|D]v\}}}}}	|dk(r |jt|||	d����.|dk(s|dk(rt|||	�}
t	|
|�|dk(s|dk(s�]t|||�}t||��xt
|�}|S)z] Does a diff on the tokens themselves, returning a list of text
    chunks (not tokens).
    r'r*T)r*�insert�replace�delete)r+r,�extend�
expand_tokens�merge_insert�merge_delete�cleanup_delete)�html1_tokens�html2_tokensr0r1rr2r3r4r5r6�
ins_tokens�
del_tokenss            rrQrQ�s���"	#�\�\�B�A��}�}��H�
�F�#+����R��R��g���M�M�-��R��(;�4�H�I���h��'�Y�"6�&�|�B�r�':�;�J���V�,��h��'�Y�"6�&�|�B�r�':�;�J���V�,�$,��F�
#�F��Mrc#�
K�|D]v}|jEd{���|r|js>|jr |j�|jz��n|j���|jEd{����xy7�g7�	�w)zeGiven a list of tokens, return a generator of the chunks of
    text for the data in the tokens.
    N)rC�hide_when_equalrHrNrB)rr*rJs   rr\r\�sl�������>�>�!�!��E�1�1��(�(��j�j�l�U�%>�%>�>�>��j�j�l�"��?�?�"�"��!��	#�s"�B�A?�AB�7B�8B�Bc�T�t|�\}}}|j|�|r!|djd�s
|dxxdz
cc<|jd�|r|djd�r|ddd|d<|j|�|jd�|j|�y)z| doc is the already-handled document (as a list of text chunks);
    here we add <ins>ins_chunks</ins> to the end of that.  rA� z<ins>Nz</ins> )�split_unbalancedr[�endswithrE)�
ins_chunksr�unbalanced_start�balanced�unbalanced_ends     rr]r]�s���2B�*�1M�.��h���J�J�� �
�3�r�7�#�#�C�(�	�B��3����J�J�w���H�R�L�)�)�#�.���|�C�R�(�����J�J�x���J�J�y���J�J�~�rc��eZdZy)�	DEL_STARTN��__name__�
__module__�__qualname__�rrroro����rroc��eZdZy)�DEL_ENDNrprtrrrwrw�rurrwc��eZdZdZy)�	NoDeleteszY Raised when the document no longer contains any pending deletes
    (DEL_START/DEL_END) N)rqrrrs�__doc__rtrrryrys��rryc�z�|jt�|j|�|jt�y)z� Adds the text chunks in del_chunks to the document doc (another
    list of text chunks) with marker to show it is a delete.
    cleanup_delete later resolves these markers into <del> tags.N)rEror[rw)�
del_chunksrs  rr^r^s(���J�J�y���J�J�z���J�J�w�rc��		t|�\}}}t|�\}}}t|||�t	|||�|}|r!|djd�s
|dxxdz
cc<|j
d�|r|djd�r|ddd|d<|j|�|j
d�|j|�|}��#t$rY|SwxYw)a� Cleans up any DEL_START/DEL_END markers in the document, replacing
    them with <del></del>.  To do this while keeping the document
    valid, it may need to drop some tags (either start or end tags).

    It may also move the del into adjacent tags to try to move it to a
    similar location where it was originally located (e.g., moving a
    delete into preceding <div> tag, if the del looks like (DEL_START,
    'Text</div>', DEL_END)rArgz<del>Nz</del> )�split_deleteryrh�locate_unbalanced_start�locate_unbalanced_endrirEr[)�chunks�
pre_deleterZ�post_deleterkrlrmrs        rr_r_
s����	�.:�6�.B�+�J���6F�f�5M�2��(�N�	 � 0�*�k�J��n�j�+�F����s�2�w�'�'��,���G�s�N�G��
�
�7������-�-�c�2�#�B�<���,�H�R�L��
�
�8���
�
�9���
�
�;����7���	��(�M�-	�s�C�	C�Cc��g}g}g}g}|D�]!}|jd�s|j|��'|ddk(}|j�djd�}|tvr|j|��k|r�|r6|dd|k(r+|j|�|j�\}}}	|	||<��|r;|j
|D���	cgc]\}}}	|	��
c}	}}�g}|j|���|j|���|j|t|�|f�|jd���$|j
|D���cgc]\}}}|��
c}}}�|D�cgc]}|��|��	}}|||fScc}	}}wcc}}}wcc}w)a]Return (unbalanced_start, balanced, unbalanced_end), where each is
    a list of text and tag chunks.

    unbalanced_start is a list of all the tags that are opened, but
    not closed in this span.  Similarly, unbalanced_end is a list of
    tags that are closed but were not opened.  Extracting these might
    mean some reordering of the chunks.�<r	�/r�<>/rAN)�
startswithrE�splitr�
empty_tags�popr[r:)
r��start�end�	tag_stackrl�chunk�endtag�name�pos�tags
          rrhrh4s{��
�E�
�C��I��H�������$��O�O�E�"���q��S����{�{�}�Q��%�%�e�,���:���O�O�E�"����Y�r�]�1�-��5�����&�!*������c�3� #���
�����	�B�	�n�d�C��c�	�B�C��	��
�
�5�!��
�
�5�!����d�C��M�5�9�:��O�O�D�!�-�.
�L�L�'0�1�y�#�4��e��y�1�3�#+�A�8�%�u�/@��8�H�A��(�C����C��	2��As�
E.�
E5�E<�#E<c��	|jt�}|jt�}|d|||dz|||dzdfS#t$rt�wxYw)z� Returns (stuff_before_DEL_START, stuff_inside_DEL_START_END,
    stuff_after_DEL_END).  Returns the first case found (there may be
    more DEL_STARTs in stuff_after_DEL_END).  Raises NoDeletes if
    there's no DEL_START found. Nr	)�indexro�
ValueErrorryrw)r�r��pos2s   rr~r~\sc��
��l�l�9�%���<�<�� �D��$�3�<���A��d�+�V�D��F�G�_�<�<�������s�A�Ac��	|sy	|d}|j�djd�}|sy	|d}|tus|jd�sy	|ddk(ry	|j�djd�}|dk(ry	|dk7s
Jd|z��||k(r2|j	d�|j|j	d��ny	��)
a� pre_delete and post_delete implicitly point to a place in the
    document (where the two were split).  This moves that point (by
    popping items from one and pushing them onto the other).  It moves
    the point to try to find a place where unbalanced_start applies.

    As an example::

        >>> unbalanced_start = ['<div>']
        >>> doc = ['<p>', 'Text', '</p>', '<div>', 'More Text', '</div>']
        >>> pre, post = doc[:3], doc[3:]
        >>> pre, post
        (['<p>', 'Text', '</p>'], ['<div>', 'More Text', '</div>'])
        >>> locate_unbalanced_start(unbalanced_start, pre, post)
        >>> pre, post
        (['<p>', 'Text', '</p>', '<div>'], ['More Text', '</div>'])

    As you can see, we moved the point so that the dangling <div> that
    we found will be effectively replaced by the div in the original
    document.  If this doesn't work out, we just throw away
    unbalanced_start without doing anything.
    r	rz<>r�r��ins�delzUnexpected delete tag: %rN)r�rror�r�rE)rkr�r��finding�finding_name�nextr�s       rrrhs���,���"�1�%���}�}��q�)�/�/��5�����1�~���9��D�O�O�C�$8����7�c�>���z�z�|�A��$�$�T�*���5�=���u�}�	0�'�$�.�	0�}��<��� � ��#����k�o�o�a�0�1�
�5rc�f�	|sy|d}|j�djd�}|sy|d}|tus|jd�sy|j�djd�}|dk(s|dk(ry||k(r1|j	�|jd|j	��ny��)zt like locate_unbalanced_start, except handling end tags and
    possibly moving the point earlier in the document.  rArr��</r�r�N)r�rrwr�r�rX)rmr�r�r�r�r�r�s       rr�r��s������ ��$���}�}��q�)�/�/��6�����"�~���7�?�$�/�/�$�"7���z�z�|�A��$�$�U�+���5�=�D�E�M���<����� ����q�*�.�.�"2�3�
�+rc�(�eZdZdZdZdd�Zd�Zd�Zy)rJa8 Represents a diffable token, generally a word that is displayed to
    the user.  Opening tags are attached to this token when they are
    adjacent (pre_tags) and closing tags that follow the word
    (post_tags).  Some exceptions occur when there are empty tags
    adjacent to a word, so there may be close tags in pre_tags, or
    open tags in post_tags.

    We also keep track of whether the word was originally followed by
    whitespace, even though we do not want to treat the word as
    equivalent to a similar word that does not have a trailing
    space.FNc��tj||�}|�||_ng|_|�||_ng|_||_|S�N)r�__new__rCrBrH)�clsrrCrBrH�objs      rr�z
token.__new__�sI�����s�D�)����#�C�L��C�L�� �%�C�M��C�M�"5����
rc	��dtj|��d|j�d|j�d|j�d�	S)Nztoken(�, �))r�__repr__rCrBrH��selfs rr�ztoken.__repr__�s1��*2�*;�*;�D�*A�4�=�=�*.�.�.�$�:R�:R�T�	Trc��t|�Sr�)rr�s rrNz
token.html�s����~�r�NNr)rqrrrsrzrer�r�rNrtrrrJrJ�s��
��O��"T�rrJc�(�eZdZdZ		dd�Zd�Zd�Zy)�	tag_tokenz� Represents a token that is actually a tag.  Currently this is just
    the <img> tag, which takes up visible space just like a word but
    is only represented in a document by a tag.  Nc�v�tj|t�d|��|||��}||_||_||_|S)Nz: rG)rJr�rIr��data�	html_repr)r�r�r�r�rCrBrHr�s        rr�ztag_token.__new__�sD���m�m�C�T�4�!8�%-�&/�0C��E��������!��
��
rc
��d|j�d|j�d|j�d|j�d|j�d|j
�d�
S)Nz
tag_token(r�z, html_repr=z, post_tags=z, pre_tags=z, trailing_whitespace=r�)r�r�r�rCrBrHr�s rr�ztag_token.__repr__�s8���H�H��I�I��N�N��M�M��N�N��$�$�
&�	&rc��|jSr�)r�r�s rrNztag_token.html�s���~�~�rr�)rqrrrsrzr�r�rNrtrrr�r��s��5�59�46�	�&�rr�c��eZdZdZdZd�Zy)�
href_tokenzh Represents the href in an anchor tag.  Unlike other words, we only
    show the href when it changes.  Tc��d|zS)Nz	 Link: %srtr�s rrNzhref_token.htmls
���T�!�!rN)rqrrrsrzrerNrtrrr�r��s��(��O�"rr�c�~�tj|�r|}n
t|d��}t|d|��}t	|�S)ak
    Parse the given HTML and returns token objects (words with attached tags).

    This parses only the content of a page; anything in the head is
    ignored, and the <head> and <body> elements are themselves
    optional.  The content is then parsed by lxml, which ensures the
    validity of the resulting parsed document (though lxml may make
    incorrect guesses when the markup is particular bad).

    <ins> and <del> tags are also eliminated from the document, as
    that gets confusing.

    If include_hrefs is true, then the href attribute of <a> tags is
    included as a special kind of diffable token.T��cleanup)�skip_tagr")r�	iselement�
parse_html�
flatten_el�fixup_chunks)rNr"�body_elr�s    rr#r#s:��
���t�����T�4�0��
��$�m�
L�F����rc�6�|rt|�}t|d��S)a
    Parses an HTML fragment, returning an lxml element.  Note that the HTML will be
    wrapped in a <div> tag that was not in the original document.

    If cleanup is true, make sure there's no <head> or <body>, and get
    rid of any <ins> and <del> tags.
    T)�
create_parent)�cleanup_htmlr)rNr�s  rr�r�s����D�!���t�4�8�8rz	<body.*?>z
</body.*?>z</?(ins|del).*?>c���tj|�}|r||j�d}tj|�}|r|d|j	�}t
j
d|�}|S)z� This 'cleans' the HTML, meaning that any page structure is removed
    (only the contents of <body> are used, if there is any <body).
    Also <ins> and <del> tags are removed.  Nr)�_body_re�searchr��_end_body_rer��_ins_del_re�sub)rN�matchs  rr�r�,sa��
�O�O�D�!�E���E�I�I�K�L�!������%�E���N�U�[�[�]�#���?�?�2�t�$�D��Krz
[ \t\n\r]$c�H�t|j��}|d|||dfS)zP
    This function takes a word, such as 'test

' and returns ('test','

')
    rN)r:�rstrip)�word�stripped_lengths  r�split_trailing_whitespacer�<s.���$�+�+�-�(�O���/�"�D��)9�$:�:�:rc
���g}d}g}|D�]-}t|t�rq|ddk(r:|d}t|d�\}}td||||��}g}|j	|�n.|ddk(r&|d}t||d�	�}g}|j	|���t
|�r0t|�\}}t|||�	�}g}|j	|���t|�r|j	|���t|�rF|r|j	|���|sJd
|�d|�d|�d
|����|jj	|���.J�|std|��gS|djj|�|S)zM
    This function takes a list of chunks and produces a list of tokens.
    Nr�imgr	�)r�rCrH�hrefrg)rCrHzWeird state, cur_word=z	, result=z	, chunks=z of r)rCrA)�
isinstance�tupler�r�rEr��is_wordrJ�is_start_tag�
is_end_tagrBr[)	r��	tag_accum�cur_wordrr�r<r�rHr�s	         rr�r�Ds����I��H�
�F����e�U�#��Q�x�5� ��A�h��+D�U�1�X�+N�(��(�$�U�C�3�.7�9L�N���	��
�
�h�'��q��V�#��Q�x��%�d�Y�TW�X���	��
�
�h�'���5�>�)B�5�)I�&�E�&��U�Y�L_�`�H��I��M�M�(�#�
�%�
 ����U�#�
��
��� � ��'��9�����8�9�x��"�"�)�)�%�0��5�I�L��b�9�-�.�.��r�
���#�#�I�.��Mr)
�paramr��area�br�basefont�input�base�meta�link�col)�address�
blockquote�center�dir�div�dl�fieldset�form�h1�h2�h3�h4�h5�h6�hr�isindex�menu�noframes�noscript�ol�p�pre�table�ul)
�dd�dt�frameset�li�tbody�td�tfoot�th�thead�trc#�bK�|s<|jdk(r d|jd�t|�f��n
t|���|jtvr$|jst|�s
|jsyt|j�}|D]}t|����|D]}t||��Ed{����|jdk(r(|jd�r|rd|jd�f��|s7t|���t|j�}|D]}t|����yy7�w�w)a Takes an lxml element el, and generates all the text chunks for
    that tag.  Each start tag is a chunk, each word is a chunk, and each
    end tag is a chunk.

    If skip_tag is true, then the outermost container tag is
    not returned (just its contents).r�r<Nr!r(r�)r��get�	start_tagr�rr:�tail�split_wordsr
r��end_tag)�elr"r��start_wordsr��child�	end_wordss       rr�r��s������
�6�6�U�?��"�&�&��-��2��7�7��B�-��	�v�v���B�G�G�C��G�B�G�G���b�g�g�&�K����$�������e�=�A�A�A��	�v�v��}������M��r�v�v�f�~�&�&���b�k������(�	��D��d�#�#���	B�s�B3D/�5D-�6A8D/z\S+(?:\s+|$)c�X�|r|j�sgStj|�}|S)z_ Splits some text into words. Includes trailing whitespace
    on each word when appropriate.  )r�split_words_re�findall)r�wordss  rr
r
�s)���t�z�z�|��	��"�"�4�(�E��Lrz
^[ \t\n\r]c���d|j�dj|jj�D��cgc]\}}d|�dt	|d��d���c}}��d�Scc}}w)z=
    The text representation of the start tag for a tag.
    r�rrgz="T�"�>)r�r�attrib�itemsr
)rr��values   rrr�s^��
	������,.�I�I�O�O�,=�?�,=�[�T�5�(,�[���-E�F�,=�?�@�A�A��?s�Ac��|jr"tj|j�rd}nd}d|j�d|��S)zg The text representation of an end tag for a tag.  Includes
    trailing whitespace when appropriate.  rgrr�r)r	�start_whitespace_rer�r�)r�extras  rrr�s7��
�w�w�&�-�-�b�g�g�6����������&�&rc�&�|jd�S)Nr��r��r%s rr�r��s���~�~�c�"�"�"rc�$�|jd�S)Nr�rrs rr�r��s���>�>�$��rc�L�|jd�xr|jd�S)Nr�r�rrs rr�r��s"���>�>�#��;�s�~�~�d�';�#;�;rc�P�t|d��}t|�t|d��}|S)z� Given an html string, move any <ins> or <del> tags inside of any
    block-level elements, e.g. transform <ins><p>word</p></ins> to
    <p><ins>word</ins></p> Fr�T)�
skip_outer)r��_fixup_ins_del_tags�serialize_html_fragment)rNrs  rrRrR�s)���T�5�
)�C����"�3�4�8�D��Krc���t|t�r
Jd|z��tj|dt��}|r;||jd�dzd}|d|j
d�}|j�S|S)z� Serialize a single lxml element as HTML.  The serialized form
    includes the elements tail.  

    If skip_outer is true, then don't serialize the outermost tag
    z3You should pass in an element, not a string like %rrN)�method�encodingrr	Nr�)r��
basestringr�tostringr�find�rfindr)rr#rNs   rr%r%�sz���"�j�)�D�=��B�D�)��>�>�"�V�h�?�D���D�I�I�c�N�1�$�%�&���$�T�Z�Z��_�%���z�z�|���rc��dD]D}|jd|z�D]+}t|�s�t||��|j��-�Fy)z?fixup_ins_del_tags that works on an lxml document in-place
    )r�r�zdescendant-or-self::%s)r�N)�xpath�_contains_block_level_tag�_move_el_inside_block�drop_tag)rr�rs   rr$r$sE�����)�)�4�s�:�;�B�,�R�0��!�"�#�.��K�K�M�	<�rc�v�|jtvs|jtvry|D]}t|�s�yy)zPTrue if the element contains any block-level elements, like <p>, <td>, etc.
    TF)r��block_level_tags�block_level_container_tagsr/)rrs  rr/r/s:��
�v�v�!�!�R�V�V�/I�%I����$�U�+���rc���|D]}t|�s�nOtj|�}|j|_d|_|j	t|��|g|ddyt|�D]�}t|�rkt
||�|js�'tj|�}|j|_d|_|j|j|�dz|��ytj|�}|j||�|j|���|jr@tj|�}|j|_d|_|jd|�yy)zt helper for _fixup_ins_del_tags; actually takes the <ins> etc tags
    and moves them inside any block-level tags.  Nr	r)r/r�Elementrr[�listr0r	rXr�rYrE)rr�r�children_tag�tail_tag�	child_tag�text_tags       rr0r0s$����$�U�+���
�}�}�S�)���G�G���������D��H�%����1����b���$�U�+�!�%��-��z�z� �=�=��-�� %�
�
��
�!��
��	�	�"�(�(�5�/�!�+�X�6��
�
�c�*�I��J�J�u�i�(����U�#��
�w�w��=�=��%������
����
�	�	�!�X��	rc�:�|j�}|jxsd}|jrat|�s||jz
}nF|djr#|dxj|jz
c_n|j|d_|j	|�}|re|dk(rd}n||dz
}|�*|jr|xj|z
c_n1||_n)|jr|xj|z
c_n||_|j�|||dzy)z�
    Removes an element, but merges its contents into its place, e.g.,
    given <p>Hi <i>there!</i></p>, if you remove the <i> element you get
    <p>Hi there!</p>
    rrArNr	)�	getparentrr	r:r��getchildren)r�parentrr��previouss     r�_merge_element_contentsrA9s����\�\�^�F�
�7�7�=�b�D�	�w�w��2�w��B�G�G�O�D��"�v�{�{��2����r�w�w�&�� �g�g��2����L�L���E���A�:��H��e�A�g��H����{�{����t�#��"����}�}��
�
��%�
� $��
��N�N�,�F�5��q��rc��eZdZdZdZd�Zy)r+zt
    Acts like SequenceMatcher, but tries not to find very small equal
    blocks amidst large spans of changes
    r�c��tt|j�t|j��}t|j|dz�}tj
j
|�}|D�cgc]}|d|kDs|ds|��c}Scc}w)N�r�)�minr:r)�	threshold�difflib�SequenceMatcher�get_matching_blocks)r��sizerF�actual�items     rrIz.InsensitiveSequenceMatcher.get_matching_blockscs~���3�t�v�v�;��D�F�F��,�������q��1�	��(�(�<�<�T�B��!'� �����7�Y�&��A�w��� �	 �� s�/BN)rqrrrsrzrFrIrtrrr+r+[s���
�I� rr+�__main__)�_diffcommand)F)T)NrG�lxmlr�	lxml.htmlr�re�__all__rNrr
�ImportError�cgi�unicoder�	NameError�strr)rrrrr-rrDrrrQr\r]rorw�	Exceptionryr^r_rhr~rr�rJr�r�r#r��compile�I�Sr�r�r�r��end_whitespace_rer�r�r�r3r4r��Urr
rrrr�r�r�rRr%r$r/r0rArHr+rqrN�mainrtrr�<module>r_sd����)�	��J�
'��*�*���H���1�#1�&#�P�-�1�
��$#�&&�.$�L#��.	�	�	�	��	���%�N& �P
=�0�d�4'�H�'�R���8"��"� �09��2�:�:�l�B�D�D����I�.���r�z�z�-����b�d�d��3���b�j�j�,�b�d�d�2�4�4�i�8����B�J�J�}�-��;�2�l#�
���6��$�6����O�R�T�T�2���!�b�j�j��/��A�'�#� �<���$���@ -�D ��!8�!8� � �z��&��L������}�*�)�)�*�����H���
���J��s3�G�G#�G1�G �G �#G.�-G.�1G<�;G<
XylotrechusZ Shell