How to get textContent including childNodes?

Question

I have some plain text content in paragraphs inside a

HTML element. the paragraphs are separated by new lines (n), not in

tags, and I would like to automatically wrap them in

tags using JavaScript. Example content: Inside of

there may be elements. I wa…

Accepted Answer

To retain non-text content like images, you’ll need to process the text nodes of the main element rather than using textContent, since that’s just the text content of the element.Assuming you only want to do this with the text nodes, you can loop through the element’s nodes, split text nodes on line breaks, and if you get more than one segment, insert paragraphs for them. Something like this (see inline comments): function convertLineBreaksToParagraphs(element) { // Get a snapshot of the child nodes of the element; we want // a snapshot because we may change the element's contents const nodes = [...element.childNodes]; // Loop through the snapshot for (const node of nodes) { // Is this a text node? if (node.nodeType === Node.TEXT_NODE) { // Yes, split it on line breaks const parts = node.nodeValue.split(/rn|r|n/); // Did we find any? if (parts.length > 1) { // Yes, loop through the "paragraphs" for (const part of parts) { // Create an actual paragraph for it const p = document.createElement("p"); p.textContent = part; // Insert in in front of the text node it came from element.insertBefore(p, node) } // Remove the text node we've replaced with paragraphs element.removeChild(node); } } }}convertLineBreaksToParagraphs(document.querySelector("main"));

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum. Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

You may need to tweak that a bit, depending on how you want to handle that image just before the fifth paragraph. The above leaves the image outside the paragraph. But if you wanted it to be inside the paragraph, you could add some logic to do that. function convertLineBreaksToParagraphs(element) { let img = null; // Get a snapshot of the child nodes of the element; we want // a snapshot because we may change the element's contents const nodes = [...element.childNodes]; // Loop through the snapshot for (const node of nodes) { // Is this a text node? if (node.nodeType === Node.TEXT_NODE) { // Yes, split it on line breaks const parts = node.nodeValue.split(/rn|r|n/); // Did we find any? if (parts.length > 1) { // Yes, loop through the "paragraphs" for (const part of parts) { // Create an actual paragraph for it const p = document.createElement("p"); p.textContent = part; // If we *just* saw an image before this text node, // move it into the paragraph if (img) { p.insertBefore(img, element.firstChild); img = null; } // Insert in in front of the text node it came from element.insertBefore(p, node) } // Remove the text node we've replaced with paragraphs element.removeChild(node); } } else if (node.nodeName === "IMG") { img = node; } else { img = null; } }}convertLineBreaksToParagraphs(document.querySelector("main"));

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum. Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

You may have people telling you to do this by manipulating the HTML from innerHTML, but the problem with doing that is you run the risk of introducing tags in the middle of a tag (and you will remove any event handlers when you set innerHTML on main). For instance, if you have:

you’d end up with src="/path/to/something">

…which is obviously not good.

Advertisement

Answer